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(54) Crystal structure of dipeptidyl peptidase IV 

(57) The present invention relates to crystal struc- 
ture information obtained from crystals of the dipeptidyl- 
peptidase DPP-IV, to methods of preparing such crys- 
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of inhibitors of DPP-IV activity. A further subject matter 
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or design of inhibitor compounds of DPP-IV activity, the 



inhibitor compounds of DPP-IV activity identified by 
these methods and their use in pharmaceutical compo- 
sitions for the treatment and/or prevention of diseases 
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Description 

[0001] The present invention relates to crystal structure information obtained from crystals of the dipeptidyl-peptidase 
DPP-IV, to methods of preparing such crystals, and to their use for the identification and/or design of inhibitors of 
s DPP-IV activity. A further subject matter of the invention are methods for the identification and/or design of inhibitor 
compounds of DPP-IV activity, the inhibitor compounds of DPP-IV activity identified by these methods and their use 
in pharmaceutical compositions for the treatment and/or prevention of diseases comprising diabetes types I and II, IGT 
and obesity. 

[0002] Dipeptidyl peptidase (DPP-IV; T-cell activation antigen CD26 or adenosine binding protein) is a multifunctional 

w type II cell surface glycoprotein. The protein is widely expressed in a variety of cell types, particularly on differential 
epithelial cells of the intestine, liver, prostate tissue, corpus luteum, and kidney proximal tubles (Hartel, S., Gossrau, 
R., Hanski, C. & Reutter, W. (1 988). Dipeptidyl peptidase (DPP) IV in rat organs. Comparison of immunohistochemistry 
and activity histochemistry. Histochemistry 89 , 151-161; McCaughan, G.W., Wickson, J.E., Creswick, P.F. & Gorrell, 
M.D. (1990). Identification of the bile canalicular cell surface molecule GP1 10 as the ectopeptidase dipeptidyl peptidase 

15 IV: an analysis by tissue distribution, purification and N-terminal amino acid sequence. HepatologyW , 534-544) as 
well as leukocyte subsets (Gorrell, M.D., Wickson, J. & McCaughan, G.W. (1991). Expression of the rat CD26 antigen 
(dipeptidyl peptidase IV) on subpopulations of rat lymphocytes. Cell. Immunol. 134, 205-215), such as T-helper lym- 
phocytes, and subsets of macrophages (Buhling, F., Kunz, D., Reinhold, D., Ulmer, A.J., Ernst, M., Flad. H.D. & An- 
sorge, S. (1 994). Expression and functional role of dipeptidyl peptidase IV (CD26) on human natural killer cells. Nat. 

20 Immun. 13, 270-279) and a soluble form is reported to be present in plasma and urine (Iwaki-Egawa, S., Watanabe, 
Y., Kikuya, Y. & Fujimoto, Y. (1 998). Dipeptidyl peptidase IV from human serum: purification, characterization, and N- 
terminal amino acid sequence. J. Biochem. 124, 428-433). Human DPP-IV has a short cytoplasmatic tail of six amino 
acids, a 22 amino acid hydrophobic transmembrane region and a 738 amino acid extracellular domain with ten potential 
glycosylation sites (Tanaka.T, Camerini, D.,Seed : B., Torimoto, Y, Dang, N.H., Kameoka, J., Dahlberg, H.N., Schloss- 

25 man, S.F. & Morimoto, C. (1 992). Cloning and functional expression of the T cell activation antigen CD26. J. Immunol. 
149,481-486). 

[0003] DPP-IV is involved in many biological processes, including a membrane-anchoring function forthe localization 

of the extracellular enzyme adenosine deaminase (ADA) (Franco, R., Valenzuela, A., Lluis, C. & Blanco, J. (1998). 

Enzymatic and extraenzymatic role of ecto-adenosine deaminase in lymphocytes. Immunol. Rev. 161 , 27-42), partic- 
30 ipation in cell matrix adhesion by binding to collagen and fibronectin (Loster, K., Zeilinger, K., Schuppan, D. & Reutter, 

W. (1995). The cysteine-rich region of dipeptidyl peptidase IV (CD 26) is the collagen-binding site. Biochem. Biophys. 

Res. Commun. 217, 341-348), interaction as a co-receptor for the HIV envelope protein gp 120 (Ohtsuki, T, Tsuda, H. 

& Morimoto, C. (2000). Good or evil: CD26 and HIV infection. J. Dermatol. Sci. 22, 152-160) and co-stimulatory function 

during T-cell activation and proliferation (von Bonin, A., Huhn, J. & Fleischer, B. (1998). Dipeptidyl-peptidase IV/CD26 
35 on T cells: analysis of an alternative T-cell activation pathway. Immunol. Rev. 1 61 , 43-53) by interaction with the protein 

tyrosine phosphatase (CD45) (Torimoto, Y, Dang, N.H., Vivier, E.,Tanaka, T, Schlossman, S.F. & Morimoto, C. (1991). 

Coassociation of CD26 (dipeptidyl peptidase IV) with CD45 on the surface of human T lymphocytes. J. Immunol. 147, 

2514-2517). 

[0004] DPP-IV (EC 3.4.14.5) has postproline dipeptidyl amino peptidase activity, preferentially cleaving X-proline or 

40 X-alanine dipeptides from the N-terminus of polypeptides (Hopsu-Havu, V.K. & Glenner, G.G. (1966). A new dipeptide 
naphthylamidase hydrolyzing glycyl-prolyl-beta-naphthylamide. Histochemie 7 , 197-201.) and belongs to the prolyl 
oligopeptidase family, a group of atypical serine proteases able to hydrolyse the prolyl bond (Cunningham, D.F & 
O'Connor, B. (1 997). Proline specific peptidases. Biochim. Biophys. Acta 1343, 160-1 86). It possesses a novel orien- 
tation of its catalytic triad residues (Ser-Asp-His) (Ikehara, Y, Ogata, S. & Misumi, Y. (1994). Dipeptidyl-peptidase IV 

45 from rat liver. Methods Enzymol. 244, 21 5-227.), inverse to that found in classical serine proteases (His-Asp-Ser). The 
cleavage of N-terminal peptides with Pro in the second position is a rate limiting step in the degradation of peptides. 
The natural substrates of DPP-IV include several chemokines, cytokines, neuropeptides, circulating hormones and 
bioactive peptides (Lambeir, A.M., Durinx, O, Proost, P., Van Damme, J., Scharpe, S. & De Meester, I. (2001). Kinetic 
study of the processing by dipeptidyl-peptidase IV/CD26 of neuropeptides involved in pancreatic insulin secretion. 

so FEBS Lett. 507, 327-330.). The wide range of substrates suggests a key regulatory role in the metabolism of peptide 
hormones and in amino acid transport (Hildebrandt, M., Reutter, W., Arck, P., Rose, M. & Klapp, B.F. (2000). A guardian 
angel: the involvement of dipeptidyl peptidase IV in psychoneuroendocrine function, nutrition and immune defence. 
Clin Sc/ 99, 93-104). Its physiological relevance has been investigated by (Hinke, S.A., Pospisilik, J. A., Demuth, H.U., 
Mannhart, S., Kuhn-Wache, K., Hoffmann, T, Nishimura, E., Pederson, R.A. & Mcintosh, C.H. (2000). Dipeptidyl pepti- 

55 <jase IV (DPIV/CD26) degradation of glucagon. Characterization of glucagon degradation products and DPIV-resistant 
analogs. J. Biol. Chem. 275, 3827-3834). 

[0005] The finding that DPP-IV is responsible for more than 95% of the degradation of GLP-1 led to an elevated 
interest in inhibition of this enzyme forthe treatment of diabetes type II. Experiments in rats and humans have provided 
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evidence that specific DPP-IV inhibition increased C max , T 1/2 and total circulating GLP-1 and decreased plasma glu- 
cose. It has been demonstrated that patients with impaired glucose-tolerance (IGT), type-ll diabetes and with a sec- 
ondary failure to respond to sulfonylurea treatment benefit from increased levels of GLP peptides. In addition GLP-1 
is effective in type-l diabetic patients due to its glucagono-static effect. More recent investigations show a delay of 
gastric emptying that could have beneficial effects on satiety and might be relevant for the treatment of obesity. Pro- 
tection of functional GLP-1 by inhibition of DPP-IV and concomitant activation of the GLP-1 receptor might therefore 
have a synergistic potential in anti-diabelic drug research (Hoist, J.J. & Deacon, C.F. (1998). Inhibition of the activity 
of dipeptidyl-peptidase IV as a treatment for type 2 diabetes. Diabetes 47, 1 663-1 670.). Selective and orally available 
small molecule inhibitors of DPP-IV have been discovered and are now in clinical trials (Villhauer, E.B., Brinkman, J. 
A., Naderi, G.B., Dunning, B.E., Mangold, B.L., Mone, M.D., Russell, M.E., Weldon, S,C. & Hughes, T.E. (2002). 
1-[2-[(5-Cyanopyridin-2-yl)amino]ethylamino]acetyl-2-(S)-pyrrolidinecarbon nitrile: a potent, selective, and orally bio- 
available dipeptidyl peptidase IV inhibitor with antihyperglycemic properties. J. Med. Chem. 45, 2362-2365; Pospisilik, 
J.A., Stafford, S.G., Demuth, H.U., Mcintosh, C.H. & Pederson, R.A. (2002). Long-term treatment with dipeptidyl pepti- 
dase IV inhibitor improves hepatic and peripheral insulin sensitivity in the VDF zucker rat: a euglycemic-hyperinsuiine- 
mic clamp study. Diabetes 51 , 2677-2683). 

[0006] Therefore, the present invention provides a solution to the problem of identifying and/br designing inhibitors 
of DPP-IV activity by providing crystals of the extracellular domain of DPP-IV and their crystal structure information, 
methods of preparing such crystals, and methods of identifying and/or designing inhibitors of DPP-IV with these crystals 
by structure based drug design. 

[0007] The present invention relates to crystal structure information obtained from crystalline preparations of the 
dipeptidyl-peptidase DPP-IV. to methods of preparing such crystals, and to their use for the identification and/or design 
of inhibitors of DPP-IV activity. A further subject matter of the invention are methods for the identification and/or design 
of inhibitor compounds of DPP-IV activity, the inhibitor compounds of DPP-IV activity identified by these methods and 
their use in pharmaceutical compositions for the treatment and/or prevention of diseases comprising diabetes types I 
and II, obesity and cancer. 

[0008] Figure 1 . Sequence alignment of DPP-IV and POP: Amino acid sequence alignment of DPP-IV from human 
(hDPP-IV) and rat (rDPP-IV, only different residues are shown). The alignment of POP from pork was performed using 
structural superposition for the ct/p-hydrolase domain only, because of a lack of structural homology forthe p-propeller 
domain. The top line gives additional information about the secondary structure of DPP-IV (yellow arrows and red 
bars), the glycosylation sites with visible electron density (Y), the potential glycosylation sites (marked in red), the 
disulphide bonds (green lines between cysteins that are involved) and an arrow that indicates the start of the cloned 
ectodomain. Sequences are highlighted light gray for the transmembrane part, gray for the part of the (3-propeller 
involved in dimerization, green for residues involved in adenosine deaminase binding, blue for the tyrosine that is 
involved in the stabilization of the oxyanion of the catalytic intermediate and pink for the catalytic residues. 
[0009] Figured. Overall Structure of DPP-IV: Ribbon diagram of DPP-IV viewed perpendicular to the two-fold axis. 
The domains are colored dark green and light green for the oc/p hydrolase and (3-propeller domains of subunit A and 
dark/light blue forthe other subunit, respectively. The overall dimension of the molecule is about 125 x 80 x 60 A 3 . The 
active site is highlighted by the catalytic residues in ball and stick representation as well as residues that are identified 
by mutagenesis data to be important for ADA binding. The proposed location at the cell surface is shown by the sche- 
matic drawing of the membrane. This figure was prepared using Molscript (Kraulis, P.J. (1 991 ). MOLSCRI PT: A program 
to produce both detailed and schematic plots of protein structures. J. Applied Crystallogr. 24, 946-950) and rendered 
with Raster3D (Merrit, E.A. & Bacon, D.J. (1997). Raster3D: photorealistic molecular graphics. Methods Enzymol 277 
505-524). 

[0010] Figure 3. Ribbon drawing of the p-propeller domains of DPP-IV and POP: A: DPP-IV has 8 repeats of a 
structural motif that consists of four antiparallel (3-strands orblades (blades are numbered 1 to 8). Additional secondary 
structural elements are colored magenta: An antiparallel p-sheet (P2/2a and P2/2b in Figure 1) that is an extension of 
blade 2 with Arg125 at the tip of the turn that is involved in the substrate binding. An a-helix (a2* in Figure 1) with the 
C-terminal glutamate rich loop that contributes to substrate recognition and specificity (Glu204/205/206). The antipar- 
allel p-sheet that forms a main part of the dimer interface (p1* and p2* in Figure 1). The latter structural elements are 
extensions of the blade 4. 
B: p-propeller domain of DPP-IV rotated 90° 

C: POP has 7 blades and no notable deviations from the p-propeller structure. The blades are numbered 1 to 7. 
[0011] Figure 4. Access to the active site: Schematic view on the subunit of DPP-IV with the active site surface 
coloured according to the atom types. The substrate Diprotin A is shown with white carbons indicating the substrate 
binding site. Arrows illustrate that the substrate may enter the active site at the well accessible and open active site 
cleft and the dipeptidic product of the catalytic reaction may leave the active site cavity via the more narrow tunnel that 
is formed by the p-propeller. 

[0012] Figure 5. Active site of DPP-IV with Diprotin A (lle-Pro-lle): The substrate Diprotin A is trapped as tetrahedral 
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intermediate covalently bound to the active site Ser630. Dashed lines indicate hydrogen bonds. Bonds are dark blue 
for the protein and light blue for the ligand as well as the active site Ser630. Drawn with MOLOC (Gerber, P.R. (1992). 
Peptide mechanics: a force field for peptides and proteins working with entire residues as small unites. Biopolymers 
32, 1003-1 01 7).The insert shows the omit electron density (ligand and Ser630 were omitted from the calculations) 
contoured at 2.5 a (green) and 4 o (yellow). 

[0013] The present invention relates to crystals of mammalian DPP-IV, with or without a ligand bound in the active 
site, where the crystals are of sufficient quality and size to allow lor the determination of the three-dimensional X-ray 
diffraction at atomic resolution. The invention also relates to methods for producing and crystallizing the mammalian 
DPP-IV. The crystals of mammalian DPP-IV, as well as information derived from their crystal structures can be used 
to analyze and modify mammalian DPP-IV activity as well as to identify compounds that interact with DPP-IV. 
[0014] In one. aspect the present invention provides a crystal of the extracellular domain of mammalian DPP-IV, 
preferably having the orthorhombic space group symmetry P2 1 2 1 2 1 and one homodimer of DPP-IV in the asymmetric 
unit. Preferably, the crystal includes a unit cell having dimensions a, b, and c; wherein a is from 63A to 67A, b is from 
66 A to 70 A, and c is from 41 6 A to 424 A; and a = (i = y = 90°. Preferably, the crystal includes atoms arranged in a 
spatial relationship represented by the atomic structure coordinates listed in Table 4. Preferably, the crystal includes 
DPP-IV comprising the amino acid sequence from Gly31 to Pro766 of the native protein as well as shorter variants 
thereof comprising all amino acids necessary for forming the active site. Preferably, the crystal includes DPP-IV as set 
forth in S EQ ID NO:2 as well as shorter variants thereof comprising all amino acids necessary for forming the active site. 
[0015] The crystals of the invention include apo crystals and co-crystals. The apo crystals of the invention refer to 
crystals of mammalian DPP-IV formed without a bound active site or allosteric ligand. The co-crystals generally com- 
prise DPP-IV with a ligand bound to the active site or to an allosteric site. The "active site" refers in general to the site 
where the enzymatic reaction catalyzed by the enzyme takes place. An active site ligand refers to any compound which 
specifically binds to the active site of a mammalian DPP-IV. 

[001 6] Preferably, the co-crystal of the present invention is characterized as having an orthorhombic space group of 
P2 1 2 1 2 1 (space group No. 1 9) and one homodimer of DPP-IV in the asymmetric unit. 

[00171 More preferably, the co-crystal has unit cell dimensions of a is from 63 A to 67A, b is from 66 A to 70 A, and 
c is from 416 A to 424 A.; and a = p = y = 90° and a P2-,2 1 2 1 symmetry. 

[0018] The co-crystals of the invention generally comprise a crystalline DPP-IV polypeptide in association with one 
or more compounds at an active or allosteric binding site of the polypeptide. The association may be covalent or non- 
covalent. 

[0019] The DPP-IV (dipeptidyl-peptidase, DPP-IV; T-cell activation antigen CD26 or adenosine binding protein) of 
the present invention may be a mammalian DPP-IV. Preferably, the DPP-IV of the present invention is a human DPP-IV. 
More preferably, the DPP-IV of the present invention is the extracellular domain of DPP-IV. Even more preferred is the 
extracellular domain of DPP-IV which is soluble. Most preferably, the human DPP-IV comprises the amino acid se- 
quence from Gly31 to Pro766 of the native protein as well as shorter variants thereof comprising all amino acids nec- 
essary for forming the active site. Preferably, DPP-IV comprises the amino acid sequence as set forth in SEQ. ID NO: 
2 as well as shorter variants thereof comprising all amino acids necessary for forming the active site. 
[0020] It is to be understood that the crystals of DPP-IV of the invention are not limited to naturally occurring or native 
DPP-IV. Indeed, the crystals of the invention include mutants of the native DPP-IV. Mutants of native DPP-IV are 
obtained by replacing at least one amino acid residue in a native DPP-IV domain with a different amino acid residue, 
or by adding or deleting amino acid residues within the native polypeptide or at the N- or C- terminus of the native 
polypeptide, and have substantially the same three-dimensional structure as the native DPP-IV from which the mutant 
is derived. 

[0021] By having substantially the same three-dimensional structure is meant having a set of atomic structure coor- 
dinates from an apo- or co-crystal that have a root mean square deviation of less than or equal to about 1 .5 A when 
superimposed with the atomic structure coordinates of the native DPP-IV when at least 50% of the alpha carbon atoms 
of DPP-IV are included in the superposition. 

[0022] In some instances, it may be particularly advantageous or convenient to substitute, delete and/or add amino 
acid residues to a native DPP-IV domain in order to provide convenient cloning sites in cDNA encoding the polypeptide, 
to aid in purification of the polypeptide, etc. Such substitutions, deletions and/or additions which do not substantially 
alter the three dimensional structure of the native DPP-IV will be apparent to those having skills in the art. 
[0023] It should be noted that the mutants contemplated herein need not exhibit DPP-IV activity. Indeed, amino acid 
substitutions, additions or deletions that interfere with the peptidase activity of the DPP-IV but which do not significantly 
alter the three-dimensional structure of the domain are specifically contemplated by the invention. Such crystalline 
polypeptides, or the atomic structure coordinates obtained therefrom, can be used to identify compounds that bind to 
the native domain. These compounds may affect the activity or the native domain. 

[0024] The derivative crystals of the invention generally comprise a crystalline DPP-IV polypeptide in covalent as- 
sociation with one or more heavy metal atoms. The polypeptide may correspond to a native or a mutated DPP-IV. 
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Heavy metal atoms useful for providing derivative crystals include, by way of example and not limitation, gold and 
mercury. Alternatively, derivative crystals can be formed from proteins which have heavy atoms incorporated into one 
or more amino acids, such as seleno-methionine substitutions for methionine. 

[0025] Therefore, in a preferred embodiment of the present invention the co-crystal is a co-crystal of the extracellular 
domain of mammalian DPP-IV and HgCI 2 . 

[0026] The native and mutated DPP-IV polypeptides described herein may be isolated from natural sources or pro- 
duced by methods well known to those skilled in the art of molecular biology. Expression vectors to be used may contain 
a native or mutated DPP-IV polypeptide coding sequence and appropriate transcriptional and/or translational control 
signals. These methods include in vitro recombinant DNA techniques, synthetic techniques and in vivo recombination/ 
genetic recombination. See, for example, the techniques described in Maniatis et al., 1 989, Molecular Cloning: A Lab- 
oratory Manual, Cold Spring Harbor Laboratory, NY; and Ausubel et al. , 1 989, Current Protocols in Molecular Biology, 
Greene Publishing Associates and Wiley Interscience, NY. 

[0027] A variety of host-expression vector systems may be utilized to express the DPP-IV coding sequence. These 
include but are not limited to microorganisms such as bacteria transformed with recombinant bacteriophage DNA, 
plasmid DNA or cosmid DNA expression vectors containing the DPP-IV coding sequence; yeast transformed with 
recombinant yeast expression vectors containing the DPP-IV coding sequence; insect cell systems infected with re- 
combinant virus expression vectors (e.g. baculovirus) containing the DPP-IV coding sequence; plant cell systems 
infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosiac virus, TMV) 
or transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing the DPP-IV coding sequence; 
or animal cell systems. The expression elements of these systems vary in their strength and specificities. Depending 
on the host/vector system utilized, any of a number of suitable transcription and translation elements, including con- 
stitutive and inducible promoters such as P L of bacteriophage u, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like 
may be used; when cloning in insect cell systems, promoters such as the baculovirus polyhedrin promoter may be 
used; when cloning in plant cell systems, promoters derived from the genome of plant cells (e.g., heat shock promoters; 
the promoter for the small subunit of RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant 
viruses (e.g., the 35 S RNA promoter of CaMV; the coat protein promoter of TMV) may be used; when cloning in 
mammalian cell systems, promoters derived from the genome of mammalian cells (e.g., metallothionein promoter) or 
from mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter) may be used; when 
generating cell lines that contain multiple copies of the DPP-IV coding sequence, SV40-, BPV- and EBV-based vectors 
may be used with an appropriate selectable marker. 

[0028] In a preferred embodiment of the present invention, an isolated nucleic acid sequence encoding the soluble 
extracellular domain of DPP-IV comprising the nucleotide sequence of SEQ ID NO:1 is provided. 
[0029] Additionally, an expression vector containing an isolated nucleic acid sequence encoding the soluble extra- 
cellular domain of DPP-IV comprising the nucleotide sequence of SEQ ID NO:1 is provided. Preferably, the expression 
vector for the expression of proteins in P. pastoris which are to be secreted. Furthermore, a host cell transformed with 
the said expression vector is provided. Preferably, the host cell is Pichia pastoris. 

[0030] A further aspect of the present invention relates to a method of producing the soluble extracellular domain of 
DPP-IV comprising culturing the host cell with the said expression vector under conditions permitting the expression 
of the soluble extracellular domain of DPP-IV by the host cell. Preferably, the host cell is P. pastoris. The present 
invention also provides the soluble extracellular domain of DPP-IV produced by this method. 

[0031] Furthermore, the present invention relates to a polypeptide comprising the soluble extracellular domain of 
DPP-IV as set forth in SEQ ID NO:2. 

[0032] The apo-, derivative and co-crystals of the invention can be obtained by techniques well-known in the art of 
protein crystallography, including batch, liquid bridge, dialysis, vapor diffusion and hanging drop methods (see e.g. 
McPherson, 1982, Preparation and Analysis of Protein Crystals, John Wiley, NY; McPherson, 1990, Eur. J. Biochem 
189:1-23; Webber, 1 991 , Adv. Protein Chem. 41 : 1-36; Crystallization of Nucleic Acids and Proteins, Edited by Amaud 
Ducruix and Richard Giege, Oxford University Press; Protein Crystallization Techniques, Strategies, and Tips, Edited 
byTerese Bergfors, International University Line, 1999). Generally, the apo- or co-crystals of the invention are grown 
by placing a substantially pure DPP-IV polypeptide in an aqueous buffer containing a precipitant at a concentration 
just below that necessary to precipitate the protein. Water is then removed from the solution by controlled evaporation 
to produce crystallizing conditions, which are maintained until crystal growth ceases. 

[0033] Preferably, the crystals are produced by a method for crystallizing mammalian DPP-IV, the method comprising 
(a) providing a buffered, aqueous solution of pH 7 to 8.5 with a concentration of 7 mg/ml to 22 mg/ml of the extracellular 
domain of mammalian DPP-IV; and (b) growing crystals by vapor diffusion using a buffered reservoir solution with 
between 10% and 30% PEG, between 1 0% and 20% glycerol, wherein PEG has an average molecular weight between 
1 000 and 20000. More preferably, the extracellular domain of mammalian DPP-IV of step (a) of the method is produced 
in the yeast Pichia pastoris (P. pastoris) and then deglycosylated. For deglycosylation, different enzymes may be used 
comprising Endoglycosidase F or PNGase. 
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[0034] Preferably, co-crystals are produced by a method for co-crystallizing mammalian DPP-IV and an active site 
ligand, the method comprising (a) providing a buffered, aqueous solution of pH 7 to 8.5 with a concentration of 7 mg/ 
ml to 22 mg/ml of the extracellular domain of mammalian DPP-IV; (b) adding a molar excess of the active site ligand 
to the aqueous solution of mammalian DPP-IV; (c) growing crystals by vapor diffusion using a buffered reservoir solution 
5 with between 10% and 30% PEG, between 10% and 20% glycerol, wherein PEG has an average molecular weight 
between 1000 and 20000. More preferably, the extracellular domain of mammalian DPP-IV of step (a) of the method 
is produced in P. pastoris and then deglycosylated. 

[0035] A further aspect of the present invention relates to a crystal produced by the methods for crystallizing or co- 
crystallizing DPP-IV of the present invention. 
w [0036] Crystals may be frozen prior to data collection. 

[0037] The mosaic spread of the frozen crystals could sometimes be reduced by annealing, wherein the stream of 
cold nitrogen gas is briefly blocked, allowing the frozen crystal to thaw momentarily before re-freezing in the nitrogen 
gas stream. 

[0038] Diffraction data typically extending to 2.7 A was collected from the frozen crystals at the synchrotron beamline 
15 x06 at the Swiss light source (SLS), Villigen Switzerland. Under optimum conditions, data extending to 2.1 A was 
recorded. Preferably, the the data is collected at a resolution of 3.5 A to 2.1 A or better. More preferably, the data is 
collected at a resolution of 2.7 A to 2.1 A or better. 

[0039] Derivative crystals of the invention can be obtained by soaking apo or co-crystals in mother liquor containing 
salts of heavy metal atoms, according to procedures known to those of skill in the art of X-ray crystallography. 
20 [0040] Co-crystals of the invention can be obtained by soaking an apo crystal in mother liquor containing a ligand 
that binds to the active site, or can be obtained by co-crystallizing the DPP-IV polypeptide in the presence of one or 
more ligands that bind to the active site or to an allosteric site. Preferably co-crystals are formed with an active site 
DPP-IV ligand which is slowly hydrolysable and forms a covalent bond. One example for such an active site ligand is 
Diprotin A. 

25 [0041] In a further embodiment of the present invention a method for determining the three-dimensional structure of 
a crystallized extracellular domain of mammalian DPP-IV to a resolution of 3.5 A to 2.1 A or better is provided, the 
method comprising 



(a) crystallizing an extracellular domain of mammalian DPP-IV; and 
30 (b) analyzing the extracellular domain of mammalian DPP-IV by X-ray diffraction to determine the three-dimen- 

sional structure of the crystallized extracellular domain of mammalian DPP-IV, whereby the three-dimensional 
structure of a crystallized extracellular domain of mammalian DPP-IV is determined to a resolution of about 3.5 A 
to 2.1 A or better. 



35 [0042] The present invention further relates to a machine-readable data storage medium comprising a data storage 
material encoded with machine readable data which, when using a machine programmed with instructions for using 
said data, displays a graphical three-dimensional representation of a molecule or molecular complex comprising at 
least a portion of the extracellular domain of mammalian DPP-IV comprising the amino acids of SEQ ID NO:2, the 
extracellular domain comprising the ligand binding active site being defined by a set of points having a root mean 

40 square deviation of less than about 1 .5A from points representing the backbone atoms of said amino acids as repre- 
sented by structure coordinates listed in Table 4. 

[0043] The crystals of the invention, and particularly the atomic structure coordinates obtained therefrom, have a 
wide variety of uses. For example, the crystals and structure coordinates described herein are particularly useful for 
identifying compounds that interact with DPP-IV as an approach towards developing new therapeutic agents. Phar- 
45 maceutical compositions of said compounds can be developed, and said compounds can be used for the manufacture 
of a medicament comprising said compound for the treatment of IGT, type I and type II diabetes, obesity and cancer. 
[0044] Therefore, the present invention also relates to the use of a crystal or a co-crystal of the invention for the 
identification and/or design of inhibitors of DPP-IV activity. 

[0045] Moreover, the present invention relates to a method for identifying a compound that interacts with DPP-IV, 
so comprising the steps of 

(a) generating a three-dimensional model of DPP-IV using the structure coordinates listed in Table 4, a root mean 
square deviation from the backbone atoms of said amino acids of less than 1 .5A; and 

(b) employing said three-dimensional model to design or select a compound that interacts with DPP-IV. 
55 in another aspect, the method further comprises the steps of 

(c) obtaining the identified compound; and 

(d) contacting the obtained compound with DPP-IV in order to determine the effect the compound has on DPP-IV 
activity. 
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[0046] The compound in these methods may be a compound that interacts with the active site of DPP-IV or may be 
a compound that interacts with an allosteric site of DPP-IV. Preferred are compounds which interact with the active 
site of DPP-IV. Even more preferred are compounds, which show an inhibitory effect on DPP-IV activity in step (d) of 
the methods of the present invention. 

[0047] In a further aspect of the present invention the method for identifying a compound that interacts with DPP-IV 
is a computer-assisted method. Preferably, determining whether the compound is expected to bind to or interfere with 
the molecule or molecular complex includes performing a fitting operation between the compound and a binding site 
or substrate binding surface of the molecule or molecular complex, followed by computationally analyzing the results 
of the fitting operation to quantify the association between, or the interference with, the compound and the binding site. 
Optionally, the method further includes screening a library of compound. Optionally, the method further includes sup- 
plying or synthesizing the compound, then assaying the compound to determine whether it interacts with and has an 
effect on mammalian DPP-IV activity. 

[0048] The present invention also relates to the compounds identified by the said methods for identifying a compound 
that interacts with DPP-IV. 

[0049] The structure coordinates described herein can be used as phasing models in determining the crystal struc- 
tures of additional native or mutated DPP-IV, as well as the structures of co-crystals of such DPP-IV with active site 
inhibitors oractivators bound. The structure coordinates, as well as models of the three-dimensional structures obtained 
therefrom, can also be used to aid the elucidation of solution-based structures of native or mutated DPP-IVs, such as 
those obtained via NMR. Thus, the crystals and atomic structure coordinates of the invention provide a convenient 
means for elucidating the structures and functions of DPP-IV or other prolyl oligopeptidases. 

[0050] For purposes of clarity and discussion, the crystals of the invention will be described by reference to specific 
DPP-IV exemplary apo crystals and co-crystals. Those skilled in the art will appreciate that the principles described 
herein are generally applicable to crystals of any mammalian DPP-IV, including, but not limited to DPP-IV. 
[0051] Increased levels of glucagon like peptide 1 (GLP1 ) are beneficial for the decrease of plasma glucose in hu- 
mans. The finding that DPP-IV is responsible for more than 95% of the degradation of GLP-1 led to an elevated interest 
in inhibition of this enzyme forthe treatment of diabetes type II. Experiments in rats and humans have provided evidence 
that specific DPP-IV inhibition increased C max , T 1/2 and total circulating GLP-1 and decreased plasma glucose. It has 
been demonstrated that patients with impaired glucose-tolerance (IGT), type-ll diabetes and with a secondary failure 
to respond to sulfonylurea treatment benefit from increased levels of GLP1 peptides. In addition GLP-1 is effective in 
type-l diabetic patients due to its glucagono-static effect. More recent investigations show a delay of gastric emptying 
that could have beneficial effects on satiety and might be relevant forthe treatment of obesity. Protection of functional 
GLP-1 by inhibition of DPP-IV and concomitant activation of the GLP-1 receptor might therefore have a synergistic 
potential in anti-diabetic drug research (Hoist, J.J. & Deacon, C.F. (1998). Inhibition of the activity of dipeptidyl-peptidase 
IV as a treatment for type 2 diabetes. Diabetes 47, 1 663-1 670). Selective and orally available small molecule inhibitors 
of DPP-IV have been discovered and are now in clinical trials. 

[0052] Therefore, in a further aspect of the present invention a pharmaceutical composition comprising thecompound 
identified by the methods of the present invention as having an effect on DPP-IV activity, or pharmaceutically acceptable 
salts thereof, and a pharmaceutically acceptable carrier is provided. 

[0053] The phrase "pharmaceuticaily acceptable" is employed herein to refer to those compounds, materials, com- 
positions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with 
the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or 
complication, commensurate with a reasonable benefit/risk ratio. 

[0054] As used herein, "pharmaceutically acceptable salts" refer to derivatives of the disclosed compounds wherein 
the parent compound is modified by making acid or base salts thereof. Examples of pharmaceutically acceptable salts 
include, but are not limited to, mineral or organic acid salts of basic residues such as amines; alkali or organic salts of 
acidic residues such as carboxylic acids; and the like. The pharmaceutically acceptable salts include the conventional 
non-toxic salts or the quaternary ammonium salts of the parent compound formed, for example, from non-toxic inorganic 
or organic acids. For example, such conventional non-toxic salts include those derived from inorganic acids such as 
hydrochloric, hydrobromic, sulfuric, sulfamic, phosphoric, nitric and the like; and the salts prepared from organic acids 
such as acetic, propionic, succinic, glycolic, stearic, lactic, malic, tartaric, citric, ascorbic, pamoic, maleic, hydroxy- 
maleic, phenylacetic, glutamic, benzoic, salicylic, sulfanilic, 2-acetoxybenzoic, fumaric, benzenesulfonic, toluenesul- 
fonic, methanesulfonic, ethane disulfonic, oxalic, isethionic, and the like. 

[0055] The pharmaceutically acceptable salts of the present invention can be synthesized from the parent compound 
which contains a basic or acidic moiety by conventional chemical methods. Generally, such salts can be prepared by 
reacting the free acid or base forms of these compounds with a stoichiometric amount of the appropriate base or acid 
in water or in an organic solvent, or in a mixture of the two; generally, nonaqueous media like ether, ethyl acetate 
ethanol, isopropanol, or acetonitrile are preferred. Lists of suitable salts are found in Remington's Pharmaceutical 
Sciences, 17th ed., Mack Publishing Company, Easton, PA, 1985, p. 1418, the disclosure of which is hereby incorpo- 
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rated by reference. 

[0056] "Stable compound" and "stable structure" are meant to indicate a compound that is sufficiently robust to 
survive isolation to a useful degree of purity from a reaction mixture, and formulation into an efficacious therapeutic 
agent. 

s [0057] Furthermore, a compound identified by the methods of the present invention as having an effect on DPP-IV 
activity for use as a therapeutic active substance, in particular for the treatment of diabetes type I, diabetes type II, 
IGT, obesity and cancer, is provided. 

[0058] A further aspect of the present invention relates to the use of a compound identified by the methods of the 
present invention as having an effect on DPP-IV activity for the manufacture of a medicament for the treatment of 
10 diabetes type-l, diabetes type-ll, IG, obesity, and cancer. 

[0059] Having now generally described this invention, the same will become better understood by reference to the 
specific examples, which are included herein for purpose of illustration only and are not intended to be limiting unless 
otherwise specified, in connection with the following figures. 

15 Examples 

[0060] Commercially available reagents referred to in the examples were used according to manufacturer's instruc- 
tions unless otherwise indicated. 

20 Example 1 

DNA manipulation and sequence analysis 

[0061] Preparation of DNA probes, digestion with restriction endonucleases, DNA ligation and transformation of E. 
25 coli strains were performed as described (Sambrook, J., Fritsch, E.F. & Maniatis, T. (1 989). Molecular Cloning: A Lab- 
oratory Manual. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY.). For DNA sequencing, the ABI PRISM 
BigDye Terminator Cycle Sequencing Ready Reaction Kit and ABI PRISM 310 Genetic analyzer were used. PCRwere 
performed in the T3 Thermocycler (Whatman Biometra), using the Pfu polymerase (Stratagene). 

30 Production and Purification of recombinant human sDPP-IV in P. pastoris 

[0062] The ectodomain of DPP-IV, residues 31 -766 (sDPP-IV), was amplified by PCR using a cDNA and the oligo- 
nucleotides 5'-TGCTGGAATTCGGCACAGATGATGCTAC-3 , (with an EcoRI site in bold) and 5'-GCA TGG TAC CTT 
GAG GTG CTA AG -3' (with a Kpnl site in bold). Using the two new restriction sites, the amplified DNA fragment (SEQ 

35 ID NO:1) was cloned into pPICZct-A vector (Invitrogen) to create a fusion with the a-mating factor signal sequence for 
the secretion of the protein. The use of the EcoRI restriction site added the amino acids glutamine and phenylalanine 
to the N-terminus of sDPP-IV. The sequence was confirmed by sequencing. pPICZa-sDPP-l V was linearized with Sacl , 
transformed by electroporation in P. pastoris strain GS115 and the phenotype of the colonies obtained was checked 
as recommended by the distributor Invitrogen. 

40 [0063] Eight transformants with phenotype MutS were screened for the expression of DPP-IV. Colonies were grown 
at 30°C in YPD medium (1% yeast extract, 2% peptone, 2% glucose) with zeocin (100 n.g/ml) to an OD 600 of 8-10. 
Cells were collected by centrifugation and resuspended in YP medium plus 2% methanol. The same amount of meth- 
anol was added every 24 h. After 48 h the medium of each clone was tested for activity (see below). sDPP-IV was 
then produced in a large scale culture using the transformed cell line with the highest activity per volume as described 

is (Dale, G.E., D'Arcy B., Yuvaniyama, C, Wipf, B., Oefner, C. & D'Arcy, A. (2000). Purification and crystallization of the 
extracellular domain of human neutral endopeptidase (neprilysin) expressed in Pichia pastoris. Acta Crystallogr. D56, 
894-897). 

[0064] Ten liters of the collected sDPP-IV supernatant of the selected transformed P. pastoris cell line was filtered 
and concentrated to 180 ml by crossflow ultrafiltration (skannette) using a 30 kDA filtration module (AGT Technology 

so corporation). The concentrate was passed over a Sephacryl 200 XK 50/1 00 size exclusion column (5 x 95 cm, Phar- 
macia) equilibrated with 50 mM Tris-HCI pH 7.8 and 1 00 mM NaCI (S-buffer). Collected fractions were screened on 
SDS-PAGE and for activity. Fractions containing sDPP-IV were diaiysed against 50 mM Tris-HCI pH 7.9. The protein 
solution was loaded on a Fractogel-TMAE column (2.6 x 13 cm, Merck) equilibrated with 50 mM Tris-HCI pH 7.9, 
washed with two column volumes of the same buffer and eluted with 500 ml of a linear gradient from 0 to 200 mM 

55 NaCI. Fractions containing sDPP-IV were diaiysed against 20 mM sodium acetat pH 4.8. The protein solution was 
loaded on a Fractogel-COO" column (1x12 cm, Merck) equilibrated with the same buffer and washed with two column 
volumes of this buffer. Bound proteins were eluted with 200 ml of a linear gradient from 50 to 500 M NaCI. The elution 
profile showed a major peak at 250 mM NaCI. Preparation of enzymatically deglycosylated sDPP-IV (sDPPIV deglycos ) 
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was carried out prior to loading on the last gelfiltration column. 0.1% EndoF1-GST was added to the pooled fractions 
of DPP-IV and incubated for 20 h at 21 °C. The concentrated protein solution was loaded on a Biosec size exclusion 
column (1 .6 x 60 cm, Merck), that was equilibrated with S-buffer. Fractions were analyzed by SDS-PAGE, showing a 
purity > 95%. N-terminal sequencing showed that the protein was efficiently processed by the STE1 3 signal peptidase 
which cleaves off the a-mating factor. Preparation of the sDPPIV deg(yco :ADA-complex was performed by addition of a 
two times excess of ADA (Sigma Type IV, from calf intestinal Mucosa) and purification using a Biosec-size exclusion 
column. 

[0065] The soluble extracellular domain of human dipeptidyl peptidase IV (sDPP-l V; residues 31 -766) was expressed 
in the yeast Pichia pastoris. The protein was secreted at the low level of 1 mg/l as estimated from the total activity. As 
a first purification step the concentrated protein was passed through a size-exclusion column which removed the main 
fraction of contaminating peptides from the yeast-peptone medium. Sequential chromatography on anion- and cation- 
exchanger and a second size exclusion chromatography were used to get protein of 95% purity as judged by 
SDS-PAGE. The yield of pure protein was 0.3 mg/l growth medium. The purified protein shows essentially identical 
kinetic parameters and inhibition constants for known inhibitors of DPP-IV to those reported for the enzyme purified 
from human serum (Tables 1 and 2). 

Analytical methods 

[0066] Purification of sDPP-l V was followed by electrophoresis on 1 0-20% Tricine SDS polyacrylamide gradient gels 
(Lammli, U.K. (1970). Cleavage of structural proteins during assembly of the head of bacteriophage T4. Nature 227, 
680-685). Protein concentrations were determined according to Bradford (Bradford, M.M. (1 976). A rapid and sensitive 
method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal. Bio- 
chem. 72, 248-254) orfor pure protein by absorption spectroscopy using the calculated molecular extinction coefficient 
at 280 nm of 193'920 M-icm-1 (A 280 °.1% = 2.27cm2/mg; Pace, C.N., Vajdos, F., Fee, L, Grimsley, G. & Gray, T. (1995). 
How to measure and predict the molar absorption coefficient of a protein. Protein Sc/4, 2411-2423). Analytical gel 
filtration chromatography was performed on a Superdex 200 12 HP. 10/30 column (Pharmacia) equilibrated with S- 
buffer. The eluate was monitored with a miniDAWN multi-angle laser light scattering detector (Wyatt) and a refractive 
index-detector (Shodex), which allows the determination of the molecular weight and dispersity over the elution peak 
(Wyatt, P.J. (1993). Light scattering and the absolute characterisation of macromolecules. Analytica Chimica Acta 272, 
1-40). Sedimentation equilibrium runs in a Beckman analytical ultracentrifuge (model Optima XL A) were performed 
at 20°C and 9000 rpm sDPP-IV deg!ycos and at 7000 rpm for sDPP-IV deglycos :ADA-complex. The initial protein concen- 
trations were 0.22 to 0.25 mg/ml in S-buffer. The absorption was followed at 280 nm. Assumed partial specific volumes 
for sDPP-IV of 0.729 cm 3 /g and ADA of 0.735 cm3/g were used to determine the molecular masses. 
[0067] Free sulfhydryl groups were determined according the procedure described by Ellman (Ellman, G.L. (1 959). 
Tissue sulfhydryl groups. Arch. Biochem. Biophys. 82, 70-77) under denaturing conditions (0.3% SDS in 50 mM Tris 
pH 8.0). 

Thermostability measurements 

[0068] The irreversible loss of activity after incubation at various temperatures was used as an operational criterion 
of the thermostability of sDPP-IV. Kinetics of irreversible heat inactivation were performed as described by Sterner et 
al. (Sterner, R., Kleemann, G.R., Szadkowski, H., Lustig, A., Hennig, M. & Kirschner, K. (1996). Phosphoribosyl an- 
thranilate isomerase from Thermotoga maritima is an extremely stable and active homodimer. Protein Sci. 5, 
2000-2008) with a final protein concentration of 20 u.g/ml in 50 mM potassium phosphate buffer at pH 7.5, containing 
100 mM NaCI. The residual activity was determined by recording the initial velocity at 25°C of the enzyme-catalyzed 
reaction (see below) and the averaged values obtained were plotted against the incubation temperature. 



[0069] DPP-IV was immobilized on a CM5 surface piasmon resonance sensor (Biacore) using standard amide cou- 
pling chemistry. The organic adlayer on this sensor type consists of carboxymethylated dextran (MW =1 00 kDA). After 
activation of the carboxylic acid groups using carbodiimide/N-hydroxysuccinimide solutions, the surface was contacted 
with a DPP-IV solution (80 u.l) containing = 1 00 u.g/ml protein in acetate buffer (1 0 mM, pH 4.5). The amount immobilized 
corresponded to a sensor response of roughly 1 0 000 P.U. The surfaces of two flow cells were modified with protein. 
To suppress baseline drift - possibly due to slow dimer dissociation - the protein of one cell was cross-linked by short 
contact with carbodiimide/N-hydroxysuccinimide solution. This treatment did not influence the protein activity since 
binding constants determined with cross - linked protein were similar to those determined with non-cross-linked protein. 
Hepes buffer (0.01 M Hepes, pH 7.4, 0.15 M NaCI, 3 mM EDTA, 0.005% polysorbate 20 (v/v)) was used as the running 
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buffer. Diprotin-A was disolved directly in this buffer. NVP-DPP728 was first dissolved in pure DMSO and then diluted 
into running buffer. The final inhibitor solution contained less than 0.1% DMSO. Binding experiments were carried out 
by contacting the immobilized protein surfaces with inhibitor solutions of varying concentrations at a flow rate of 1 0 u.l/ 
min or 30 u.l/min. After each contact with inhibitor, the protein surfaces were regenerated by extensively washing with 
5 running buffer. 

Activity assay 

[0070] The activity assay is based on the increase of fluorescence of products compared to the substrate Ala-Pro- 
7-amido-4-trifluoromethylcoumarin (Calbiochem, Smith, R.E., Reynolds, C.J. & Elder, E.A. (1992). The evolution of 
proteinase substrates with special reference to dipeptidylpeptidase IV. Histochem. J. 24, 637-647). A 20 mM stock 
solution in 10 % DMF is stored at -20°C until use. Purification was followed by using a final substrate concentration of 
50 u.M and for the determination of kinetic parameters it was varied between 1 .5 n.M and 500 uM in the assay. DPP-IV 
activity assays were performed in 96 well plates in a total assay volume of 100 uJ. The assay buffer consists of S-Buffer 
containing 0.1 mg/ml BSA. Fluorescence is detected in a Luminescence Spectrometer LS 50B (Perkin Elmer) at an 
excitation wavelength of 400 nm and an emission wavelength of 505 nm. Initial rate constants are calculated by best 
fit linear regression. 

Example 2 

20 

Crystallization and Structure determination 

[0071] For crystallization trials, sDPP-IV deglycos was concentrated to approximately 10 mg/ml. A reduced factorial 
screen was carried out using the vapour diffusion method. Crystals were obtained with 20-25% PEG 3350, 200 mM 
MgCI 2 , Tris pH 8.5 and 15% glycerol. The crystals were flash-frozen in liquid nitrogen and exhibit the orthorhombic 
space group P2 1 2 1 2 1 with cell dimensions of about 65 A, 68 A and 420 A and one dimer per asymmetric unit. They 
diffract to a maximum of 2.3 A resolution using synchrotron radiation and show rather high mosaicity (0.5-1 .2°). Addition 
of 1 mM Diprotin-A prior to crystallization led to crystals of the complex. The mercury derivative was produced by 
cocrystallization with 0.1 mM HgCI 2 . 

[0072] Data collection was performed using synchrotron radiation (Swiss light source, SLS Villigen, Switzerland and 
ID14, ESRF Grenoble, France) as well as in-house facilities (search for heavy atom derivatives, evaluation of crystal 
quaiity) and processed with DENZO (Otwinowski, Z. (1 993). Oscillation data reduction program. In Proceedings of the 
CCP4 Study Weekend: Data Collection and Processing (Wawyey, L, Isaacs, N. & Bailey, S., eds.). pp. 56-62, SERC 
Daresbury Laboratory, UK). Details of the data collection statistics are given in Table 3. All programs used are part of 
the CCP4 (CCP4 (Collaborative Computational Project, Number 4) (1994). The CCP4 suite: programs for protein crys- 
tallography. Acta Crystallogr. D, 760-763) suite, except where indicated. The structure was determined by multiwave- 
length anomalous dispersion (MAD) of the mercury derivative. One major mercury binding site per subunit (Cys 551 , 
one of the two free SH-groups Cys301 and Cys551 that are located near the active site) was identified by inspection 
of the difference patterson maps calculated from the peak wavelength data and was subsequently refined using SHARP 
(De la Fortelle, E. & Bricogne, G. (1 997). Maximum likelihood heavy-atom parameter refinement for multiple isomorphus 
replacement and multiwavelength anomalous diffraction methods. Methods Enzymol. 276, 472-494). Location of the 
twofold non-crystallographic axis was performed using this mercury site and the program find2folds (Dunten, P. & 
Hennig, M. (2002). Locating non-crystallographic symmetry elements: The program Find2Folds. Acta Crystallogr. A58, 
C76). Further analysis revealed another site per subunit (Cys301) with less occupancy and the site branched in two 
positions with about 2.4A distance. Subsequently the phases were improved by application of twofold averaging com- 
bined with solvent flattening and histogram matching as implemented in DM. The initial electron density at 2.6 A res- 
olution was readily interpretable and about 90% of the polypeptide chain could be built. The molecular model was 
refined against 2.3 A data. Subsequent rounds of manual rebuilding and refinement with REFMAC (Murshudov, G.N., 
Vagin, A.A., Lebedev, A., Wilson, K.S. & Dodson, E.J. (1 999). Efficient anisotropic refinement of macromolecular struc- 
tures using FFT. Acta Crystallogr. D 55, 247-255) led to a complete molecular structure of the polypeptide chain from 
residues Ser39 to Pro766. Details of the refined structures are reported in Table 3. Coordinates have been deposited 
in the Protein Data Bank PDB. 

Overall structure 

55 

[0073] The structure of human DPP-IV was solved by multiple anomalous dispersion (MAD) using a mercury deriv- 
ative (see Table 3) and subsequently refined to an R-factor of 21 .5 % at 2.1 A resolution. The current model consists 
of all residues from Ser39 to Pro766 of the amino acid sequence of the expressed ectodomain of the protein. 
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[0074] A homodimer of DPP-IV is situated in the asymmetric unit (Figure 2). Dimerization is also observed in solution 
under various conditions and is required for activity. Each subunit is made of two domains, the catalytic domain with 
an a/p hydrolase fold containing the catalytic triad (Ser630, Asp708, His740) and a domain with an eight-bladed p- 
propellerfold, the p-propeller domain (Figure 2). The assignment of the secondary structure is given in Figures 1 and 
s 2. The only other known crystal structure of this class of enzyme is prolyl-oligopeptidase (POP) determined by Fiilop 
(Fiilop, V., Bocskei, Z. & Polgar, L. (1998). Prolyl oligopeptidase: an unusual beta-propeller domain regulates proteol- 
ysis. Ce//94, 161-170; pdb entry 1qfm) POP also has an o/p-hydrolase and a p-propeller domain, but is monomeric 
and the p-propeller consists of seven repeats only (Figure 3C). 

io Catalytic Domain 

[0075] The catalytic domain is built up of residues Gln508 to Pro766 and contains a central eight-stranded parallel 
P-sheet that is flanked by 12 helices known as ot/p hydrolase fold. 21% sequence identity to POP indicates significant 
structural homology (Figure 1) and superposition of the central a-helix, carrying the catalytic Ser630 on its first turn, 
is with the corresponding structure of POP gives an r.m.s deviation of 2.5 A 2 for 238 residues. The catalytic domain is 
connected to the p-propeller by an N-terminal 1 5 residue linker, which is considerably shorter than the corresponding 
76 residue region in POP. The residues lacking in DPP-IV are, however, replaced structurally and functionally by the 
C-terminal part of the catalytic domain of the second subunit of the dimer. 

20 p-propeiler domain 

[0076] The p-propeller domain is formed by the residues Lys56 to Asn497. The preceding N-terminal residues Ser39 
to Leu55 form a loop structure with a small a-helix (a1*, Figure 1) at the surface and in close proximity to the first 
residues of the catalytic domain. The p-propeller domain consists of an eight-fold repeat of a four-stranded antiparallel 
25 p-sheet motif (blade, Figure 3). The blades are in circular arrangement such that they form a solvent filled tunnel with 
a diameter of about 1 3 A. 

[0077] The p-propeller domain in DPP-IV does not form a joint p-sheet motif (described as molecular "velcro"; Fiilop, 
V. & Jones, D.T (1 999). Beta propellers: structural rigidity and functional diversity. Curr. Opin. Struct. Biol. 9, 715-721 ; 
Paoli, M. (2001). Protein folds propelled by diversity. Prog. Biophys. Mol.Biol.76, 103-130), but rather the blades show 

30 a regular arrangement (pi/1 to P7/4 or p8/4) (Figure 3A) around the central axis forming a ring system that is not closed. 
[0078] DPP-IV deviates from the regular p-propeller fold by additional secondary structural elements. An anti-parallel 
P-sheet is inserted in blade two between the strands one and two. The tip of the turn carries the residues Arg125 that 
forms a salt bridge with Glu205, that is situated at the C-terminal turn of an a-helix (residues Trp154 to Thr1 99), that 
is inserted between the first and second strands of blade 4. Arg125, Glu205 and the neighboring Glu204 form a sig- 

35 nificant part of the substrate binding site and are mainly responsible for the substrate specificity. An further anti-parallel 
P-sheet motif formed by residues Asp230 to Asn263 is inserted between the strands three and four of blade four (Figure 
3B). This structural element forms a significant part of the dimer interface (see below). 

[0079] Whereas the N-terminal p-sheet structure of the propeller has shorter strands and is somewhat tilted, the loop 
connecting the first and second p-sheet is longer, shows high temperature factors and may reduce the rigidity of the 
40 propeller architecture. The reduced stability of the circular domain structure at this position might be compensated by 
an extended hydrophobic cluster that consists of Ile63, Leu69, Ile76, Phe89, Leu90, Phe95, Phe98, Ile107, Ile114, 
Tyr135, Leu 137 and Leu 142, and a salt bridge between Arg61 and Asp 104 and a hydrogen bond between the main 
chain NH of Arg61 and Tyr1 05. This distortion leads to a reduced height of the propeller at the positions between blade 
one and two (Figure 3B). 

45 [0080] As no residues from the a/p hydrolase domain fill this up, a cleft between the two domains of the DPP-IV 
molecule is formed with a diameter of about 1 5 A enabling access to the catalytic site (Figure 4). Therefore, we propose 
that DPP-IV has two independent ways for the substrate and product to access and leave the active site, a cleft between 
the domains and the tunnel through the p-propeller. The open cleft may enable large peptides and partially folded 
proteins to access the active site. The more narrow tunnel could be an exit for the cleaved dipeptides (Figure 4). The 

so crystal structure of POP shows that the cleft between the two domains does not exist and the tunnel through the p- 
propeller is more narrow with about 4 A compared to about 1 3 A for DPP-IV (Figure 3A and 3C). This structural difference 
is supported by the observation that DPP-IV can process much larger substrates compared to POP. Peptides with a 
length of up to about 80 residues appear to be good substrates of DPP-IV. Larger proteins may also be cleaved de- 
pending on theirtertiary structure. POP is reported to hydrolyse substrates with a maximum size of about 30 residues, 

55 only (Polgar, L. (1 992). Unusual secondary specificity of prolyl oligopeptidase and the different reactivities of its two 
forms toward charged substrates. Biochemistry 31 , 7729-7735.). As the diameter of the p-propeller tunnel in POP is 
significantly smaller, it is conceivable that the structure of DPP-IV represents a more open and active enzyme. 
[0081] The p-propeller motif has been found in several further proteins, but no or only low sequence homology could 
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be demonstrated (Polgar, L. (1 992). Unusual secondary specificity of prolyl oligopeptidase and the different reactivities 
of its two forms toward charged substrates. Biochemistry 31 , 7729-7735.). A search of the PDB for homologous struc- 
tures gave the best results for clathrin (7 blades, ter Haar, E., Musacchio, A., Harrison, S.C. & Kirchhausen, T. (1 998). 
Atomic structure of clathrin: a beta propeller terminal domain joins an alpha zigzag linker. Cell 95, 563-573), methyl- 
amine dehydrogenase (7 blades, Chen, L, Doi, M., Durley, R.C., Chistoserdov, A.Y., Lidstrom, M.E., Davidson, V.L. & 
Mathews, F.S. (1998). Refined crystal structure of methylamine dehydrogenase from Paracoccus denitrificans at 1 .75 
A resolution. J. Mol. Biol. 276, 131-149) and nitrite reductase (8 blades, Nurizzo, D., Cutruzzola, F., Arese, M., Bour- 
geois, D., Brunori, M., Cambillau, C. & Tegoni, M. (1998). Conformational changes occurring upon reduction and NO 
binding in nitrite reductase from Pseudomonas aeruginosa. Biochemistry 37 ', 13987-13996), but no DPP-IV related 
function can be expected. 

Active site 

[0082] The catalytic triad (Ser630, Asp708, His740) is located in a large cavity at the interface of the two domains. 
Ser630 is found at the tip of a very sharp turn between (J-strand 5 and helix C, called the nucleophile elbow, which is 
a characteristic of hydrolases of the ot/p type (Ollis, D.L., Cheah, E., Cygler, M., Dijkstra, B., Frolow, F., Franken, S.M., 
Harel.M., Remington, S.J., Silman, I., Schrag, J. &etal. (1 992). The alpha/beta hydrolasefold. Protein Eng.5, 197-211). 
The serine hydroxy group is well exposed to solvent and hydrogen bonded to the catalytic imidazole group of His740 
on one side (2.6 A) and accessible to the substrate on the other side. His740 is found in the middle of a loop between 
p-strand 8 and helix F. With a distance of 2.75 A to Ne of the imidazole ring, one of the oxygen atoms of Asp708 is 
hydrogen bonded to His740 and completes the catalytic triad (Figure 5). The other oxygen atom of the carboxylate 
group of Asp708 is coordinated by two main chain NH-groups (Val711 and Asn71 0). Thus, the location and geometry 
of the triad are very similar to that found in other a/p* hydrolases with the "handedness" opposite to the classical serine 
peptidases. 

[0083] The negatively charged oxyanion of the tetrahedral intermediate is stabilized by the main chain NH-group of 
Tyr631 and by the hydroxy group of Tyr547 (Figure 5). Furthermore, the structure shows that the two Gly628 and 
Gly632 are important for the formation of the sharp turn to bring the catalytic residue Ser630 in the correct position. 
This is in accordance with mutagenesis studies on rat DPP-IV (Ogata, S., Misumi, Y, Tsuji, E., Takami, N., Oda, K. & 
Ikehara, Y. (1992). Identification of the active site residues in dipeptidyl peptidase IV by affinity labeling and site-directed 
mutagenesis. Biochemistry 31 , 2582-2587) showing that the sequence Gly 628 -X-Ser 630 -Tyr 6 3 1 -Gly 6 32 is essential for 
DPP-IV activity. 

Substrate binding 

35 [0084] The substrate binding site of DPP-IV is indicated by the inhibitor Diprotin-A (lle-Pro-lle). It is a slowly hydro- 
lysable substrate with k cat /K M a factor of 10 less than lle-Pro-4-nitroanilides (Rahfeld, J., Schierhorn, M., Hartrodt, B., 
Neubert, K. & Heins, J. (1991). Arediprotin A (lle-Pro-lle) and diprotin B (Val-Pro-Leu) inhibitors or substrates of dipep- 
tidyl peptidase IV? Biochim. Biophys. Acta 1076, 314-316). Inspection of the electron density map shows the ligand 
covalently bound to the active site Ser630 of the enzyme in both subunits. The N-terminal lie (P2) and Pro residues 

40 (P1) are well defined and enable a detailed analysis of the interaction with the substrate binding site (according to the 
notation of Schechter; Schechter, I. & Berger, A. (1968). On the active site of proteases. 3. Mapping the active site of 
papain; specific peptide inhibitors of papain. Biochem. Biophys. Res. Commun. 32, 898-902). Less well defined electron 
density is found for the C-terminal lie (P1 '), but in subunit B the conformation of this part of the ligand could also be 
observed (Figure 5). The side chain Ne of the catalytic His740 is in hydrogen bonding distance to the NH-group of P1' 

45 (2.90 A) and to the Oy of the Ser630 side chain (2.74 A). 

[0085] DPP-IV hydrolyzes oligopeptides and proteins from the N-terminus, cleaving dipeptide units when the second 
residue is proline, hydroxyproline, dehydroproline, pipecolic acid or alanine. In both subunits the proline in position P1 
of Diprotin-A is in the trans-configuration and fits optimally into the pocket of the active site as expected (Fischer, G., 
Heins, J. & Barth, A. (1983). The conformation around the peptide bond between the P1- and P2-positions is important 

so for catalytic activity of some proline-specific proteases. Biochim. Biophys. Acta 742, 452-462). The S1 pocket is formed 
by Val711, Val656, Tyr662, Tyr666, Tyr659 andTyr631 which shape a well defined hydrophobic pocket that would be 
filled by proline much betterthan by alanine. Gly is also accepted, but with very low k ca /K M values (Brandt, W. , Lehmann, 
T, Thondorf, I., Bom, I., Schutkowski, M., Rahfeld, J.U., Neubert, K. & Barth, A. (1995). A model of the active site of 
dipeptidyl peptidase IV predicted by comparative molecular field analysis and molecular modelling simulations. Int. J. 

55 Pept. Protein Res. 46, 494-507). All other naturally ocurring amino acids residues cannot occupy position P1 . Either 
the side chains are too bulky or hydrophilic. The side chains of the residues P2 and P1* point into the solvent and no 
interaction with the protein occurs. This explains the large diversity of amino acids accepted in substrates at these 
positions. 



12 



EP 1 422 293 A1 



[0086] Essential for substrate binding and catalysis is the N-terminus of the substrates, which has to be unprotected 
and protonated (Brandt, W., Ludwig, O., Thondorf, I. & Barth, A. (1 996). A new mechanism in serine proteases catalysis 
exhibited by dipeptidyl peptidase IV (DP IV) - Results of PM3 semiempirical thermodynamic studies supported by 
experimental results. Eur. J. Biochem. 236, 1 09-1 14). The Diprotin-A complex shows that the terminal -NH 3 + -group is 
held very precisely in position by strong interactions with the carboxylates of Glu205 and Glu206 (Figure 5). A third 
glulamate, Glu204, stabilizes this substrate recognition site by an hydrogen bonding network with the backbone NH 
of Arg125, His126 and Ser127 as well as the hydroxy group of Ser127. Importance of the glutamate residues is con- 
firmed by single point mutations that abolish DPP-IV activity (Abbott, C.A., McCaughan, G.W. & Gorrell, M.D. (1 999). 
Two highly conserved glutamic acid residues in the predicted beta propeller domain of dipeptidyl peptidase IV are 
required for its enzyme activity. FEBS Lett. 458, 278-284). The double Glu-motif is located at the end of an helical 
segment (o2* in Figure 1 , see also Figure 3) that is highly conserved in the DPP IV-like gene family (Asp-Trp-X-Tyr- 
G!u-Glu-Glu-X). The helix represents a deviation from the regular (3-sheet architecture of the p-propeller domain (Fig- 
ures 1 and 3A). The superposition of the active sites of the exopeptidase DPP-IV complexed with Diprotin A and the 
endopeptidase POP complexed with an octapeptide (Fiilop, V, Szeltner, Z„ Renner, V. & Polgar, L. (2001). Structures 
of prolyl oligopeptidase substrate/inhibitor complexes. Use of inhibitor binding for titration of the catalytic histidine 
residue. J. Biol. Chem. 276, 1262-1266) shows clear differences. The octapeptide substrate of POP coincides with the 
double Glu-motif in DPP-IV indicating that this additional structural element functions is very important for substrate 
selection. Thus, the double Glu-motif is a recognition site for the N-terminus of substrates and restricts the cleavage 
to dipeptides and the S1 pocket provides an optimal binding to proline and alanine residues leading to a highly specific 
peptidase. 

Mode of inhibition by Diprotin-A 

[0087] Inspection of the electron density of the bound inhibitor shows a covalent linkage to Ser630 and a spa-con- 
figuration for the C-atom of the former carbonyl-group of the scissile peptide. Consequently, a tetrahedral intermediate 
is observed in the complex structure with the substrate Diprotin A (Figure 5) with the oxyanion stabilized by hydrogen 
bonds to the hydroxy group of the side chain of Tyr547 (2.80 A) and the main chain amine of Tyr631 (3.38 A). As much 
catalytic power of serine proteases derives from its preferential binding of this transition state, the tetrahedral interme- 
diate is a well-defined but high energy state with a short lifetime and its accumulation must be a result of a kinetic barrier. 
[0088] Inspection of the active site structure reveais several structural features that are special to Diprotin A and 
may lead to the competitive inhibition of this substrate. First, the two hydrophobic isoleucine side chains point into the 
same direction in proximity and, therefore, this hydrophobic interaction may stabilize the tripeptide in a unsuitable 
conformation for the progress of the reaction. Second, a large network of salt bridges and hydrogen bonds stabilize 
the complex. It involves the carboxyl groups of Glu205/206 that interact with the N-terminus of the tripeptide, but Glu205 
makes another salt bridge to Arg1 25 and this in turn interacts with the C-terminal carboxyl group of the tripeptide (Figure 
5). It is obvious that this interaction is only present in tripeptidic substrates and may stabilize the observed intermediate 
by protection of the leaving group. 

Dimcrization 

[0089] The crystal structure as well as analytical ultracentrifugation indicate dimeric oligomerization for deglycosylat- 
ed sDPP-IV with a molecular weight of 169 kDa and non-crystallographic twofold symmetry (Figure 2). Six percent or 
1837 A 2 of the total solvent accessible surface area of each subunit is buried in the dimer interface (program XSAE, 
Broger, C. personal communication). This interface is mainly build up by two extra p-strands (p1* and p2") in the loop 
between the strands two and three of the fourth blade of the p-propeller domain (Figure 3A and 3B). Further interaction 
is provided by the a/p hydrolase domain with helix aE, p-strand p8 and helix aF with mainly hydrophobic interactions. 
The active site is very close to this dimer interface (Figure 2) with His740 from the catalytic triad located in the loop 
connecting aF and p7 (Figure 1 ). Consequently disruption of the dimer interface would also strongly affect the catalytic 
activity and dimerization is required for activity. 

Stability of DPP-IV 

[0090] As a cell surface protein DPP-IV is extremely stable. Consequently the recombinant sDPP-IV shows a half 
life of 5 min at 71 °C in irreversible heat inactivation experiments independent of the protein concentration and the 
degree of glycosylation indicating high thermal stability. In unfolding experiments (Lambeir, A.M., Diaz Pereira, J.F., 
Chacon, P., Vermeulen, G., Heremans, K., Devreese, B., Van Beeumen, J., De Meester, I. & Scharpe, S. (1997). A 
prediction of DPP IV/CD26 domain structure from a physicochemical investigation of dipeptidyl peptidase IV (CD26) 
from human seminal plasma. Biochim. Biophys. Acta 1340, 215-2) with protein purified from human seminal plasma 
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DPP-IV retained its native conformation up to 8 M Urea. 

[0091] The crystal structure points to several factors that may contribute to this stability. Firstly, the structural organ- 
ization as a dimer with an extended hydrophobic interface stabilizes the molecule as shown for several other proteins 
(Thoma, R., Hennig, M., Sterner, R. & Kirschner, K. (2000). Structure and function of mutationally generated monomers 

5 of dimeric phosphoribosylanthranilate isomerase from Thermotoga maritima. Structure Fold. Des. 8, 265-276). Sec- 
ondly, we observe live disulphide bonds and two free sulfhydryl groups by SH titration experiments under denaturing 
conditions that are now confirmed by the X-ray structure. All disulphide bridges in the |3-propeller connect different 
strands in blades orstabilize loops (Cys444/Cys447; Cys385/Cys394, Cys454/Cys472, Cys328/Cys339). One disulfide 
bond is observed in the (ct/p-hydrolase domain (Cys649/Cys762) and covalently links the C-terminal helix ocF to the 

10 core of the cc/p hydrolase domain. 

Glycosylation 

[0092] sDPP-IV overexpressed in P. pastoris shows a decreasing molecular weight over the elution peak in the 
15 analytical gelfiltration as analyzed online with a multiangle laser light scattering detector. In contrast, sDPP-IV degly- 
cosylated with EndoF glycosidase shows an uniform molecular weight over the whole peak range, because of the 
specific cleavage of asparagine linked oligomannose after the first N-acetylglucoamines residue (GlcNAc). This leads 
to a decrease in molecular weight of 20 kDa as estimated by SDS-PAGE. Crystals suitable for X-ray diffraction are 
only observed for deglycosylated sDPP-l V and structure analysis shows four GlcNAc with interpretable electron density 
20 at the positions N85, N150, N229 and N281 in subunit A. In subunit B, again N85, N150 and N229 are visible, but no 
electron density was found for N281 and an additional site could be identified at N92. The GlcNAc of N85 is involved 
in a crystal contact in both subunits. 

[0093] DPP-IV expressed in human has a more complex type of glycosylation compared to P. pastoris (Cremata, J., 
Montensino, R., Quintero, O. & Garcia, R. (1998). Glycosylation Profiling of Heterologous Proteins. In Pichia Protocol 
25 (Higgins, D.R. & Cregg, J.M., eds.), vol. 103. pp. 95-106, Humana Press: Totowa, New Jersey) and contains terminal 
sialic acid, however, this seems not to be a requirement for correct folding as shown here. 

Interaction with ADA 

30 [0094] Adenosine deaminase (ADA; EC 3.5.4.4) is a41 kDa protein expressed in all mammaliantissues that catalyzes 
the deamidation of adenosine and 2"-deoxyadenosine to inosine and 2'-deoxyinosine, respectively. It is important for 
the regulation of the extracellular concentration of adenosine and for the regulation of the immune response. ADA is 
involved in T cell activation in general and the pathogensis of autoimmune disorders (such as rheumatoid arthritis) as 
well as the mechanism of immunodeficiency disease (such as SCID or AIDS). Binding of the soluble extracellular ADA 

35 is a unique property of DPP-IV molecules of higher mammals and is not observed in mouse nor rat DPP-IV (Iwaki- 
Egawa, S., Watanabe, Y. & Fujimoto, Y. (1997). CD26/dipeptidyl peptidase IV does not work as an adenosine deami- 
nase-binding protein in rat cells. Cell Immunol. 178, 180-186). Using analytical ultra-centrifugation, we observe a 1:1 
complex of a ADA molecules with a sDPP-l V subunit giving a molecular weight of 252 kDa. Surface plasmon resonance 
(Biacore) measurements show a binding constant of 3.15 ± 2 nM to ADA from bovine with a very low dissociation rate 

40 (k otf =8.75*10- 5 s-1, ko^.gS'lO 4 M" 1 s- 1 ) indicating a strong interaction. 

[0095] Mutagenesis studies (Abbott, C.A., McCaughan, G.W., Levy, M.T., Church, W.B. & Gorrell, M.D. (1 999). Bind- 
ing to human dipeptidyl peptidase IV by adenosine deaminase and antibodies that inhibit ligand binding involves over- 
lapping, discontinuous sites on a predicted beta propeller domain. Eur. J. Biochem. 268, 798-81 0; Dong, R.P, Tachiba- 
na, K., Hegen, M., Munakata, Y, Cho, D., Schlossman, S.F & Morimoto, C. (1997). Determination of adenosine deam- 

45 inase binding domain on CD26 and its immunoregulatory effect on T cell activation. J. Immunol. 159, 6070-6076) 
identified two important regions in DPP-IV LeUa^-Val^-Ala^-Arg^ (at the beginning of (55/4) and Leu294 (a4, at 
the end of blade 4) and a less important region Glu332-Ser 3 33-Ser33 4 -Gly33 5 -Arg33 6 (loop region, at the end of 05/3) 
that are all located at the surface of the p-propeller domain (Figure 1). Mutation to amino acids found in rat DPP-IV 
reduces binding affinity to ADA. These residues form a binding site that is located far away from the active site (Figure 

so 2) confirming the independence of DPP-IV activity on ADA binding (Table 1 ; De Meester, I., Vanham, G., Kestens, L., 
Vanhoof , G., Bosmans, E., Gigase, P. & Scharpe, S. (1 994). Bindjng of adenosine deaminase to the lymphocyte surface 
via CD26. Eur. J. Immunol. 24, 566-570). It is concluded that the function of DPP-IV is the localization and orientation 
of ADA for proper catalysis. The structure gives an indication for the orientation and localization at the cell surface, 
because the N-terminus must be close to the membrane and the ADA binding would be on the opposite site of the 

55 molecule - pointing away from the cell surface (Figure 2). Further, there would be sufficient space enabling interaction 
of ADA to the A1-adenosine receptor (Ciruela, F, Saura, C, Canela, E.I., Mallol, J., Lluis, C. & Franco, R. (1996). 
Adenosine deaminase affects ligand-induced signaling by interacting with cell surface adenosine receptors. FEBS Left. 
380, 219-223) which probably plays an important role in the ontogenesis of immune tissues. This view would also 



BNSDCCID: <EP_ 



EP 1 422 293 A1 



support the hypothesis proposing a link for cell-cell interaction via the binding of DPP-IV, ADA and A1 -adenosine. 
Biological Implications 

[0096] The crystal structure of DPP-IV at 2.1 A resolution reveals a V-shaped dimeric molecule with an extended 
dimer interface fostering the conformation of the overall molecule. The membrane association and stability of DPP-IV 
is used for binding of other proteins like ADA in order to achieve localization without disturbance of the enzymatic- 
functionality. 

[0097] Analysis of the complex with Diprotin A shows key structural features for proline specific exopeptidase spe- 
cificity and activity. The negative charge of the double Glu motif guides the N-terminus of the peptide to the active site 
and fixes the substrate in the correct position for cleavage. The distance between this motif and the catalytic Ser630 
limits the cleavage to dipeptides and the S1 pocket can just adopt proline or with less affinity alanine as side chains. 
[0098] The low turnover rate of Diprotin A may be explained by the hydrophobic interaction of the two lie-residues 
in the P2 and P1" positions as well as an extensive salt bridge cluster that involves the negatively charged C-terminus 
of Diprotin A. This structural information will aid the design of new specific inhibitors. 

[0099] The active site is very accessible to the solvent by two entrances explaining that peptides can be cleaved by 
DPP-IV with almost no size limitation. A second access to the active site by the tunnel of the p-propeller domain is 
large enough to enable the release of the cleaved dipeptides. This structural arrangement certainly improves the cat- 
alytic turnover and is in great contrast to the crystal structure of POP that shows a much more narrow tunnel and no 
further access to the active site. 

[0100] For most of the special features of DPP-IV namely dimerization, regulation of substrate access via two en- 
trances, recognition of the substrate (double Glu-motif) and interaction with other proteins like ADA the ^-propeller 
domain plays a key role. Thus, DPP-IV is an excellent example that the p-propeller fold can be tailored to adapt to 
different functionality. 



Table 1 . 



Enzyme Kinetic Constants of DPP-IV 


proteins 


k cat* 


K M * 




(s-1) 


u-M) 




sDPP-IV deglycos 


43.1 


17.2 


2.51 


sDPP-IV glycos 


37.3 


15.5 


2.41 


sDPP-IV deglycos ./ADA 


39.6 


14.8 


2.68 



• analyzed using Line weave r-Burk plots; buffer: 50 mM Tris/HCI pH 7.8, containing 100 mM NaCI, 0.1 mg/ml BSA and 0.5% Dimethyl-formamid; 
tcmDcmtum- tt°Cl 



Table 2. 



K, and Kq Values of DPP-IV Inhibitors 




K i 


K D 


kon 


ko« 


uM) 


uM) 


rVHs-1 


S" 1 


lle-Pro-lle 


4.63* 


3.8t 






NVP-DPP728 


0.006* 


0.002t 


1.36-10 6 T 


2.48*1 0"3t 


NVP-DPP728 (Ljt) 


0.011 


0.010 


1.3*105 


1.3*10-3 



r measured with biacore; buffer: 0.01 M Hepes, pH 7.4, containing 0.15 M NaCI, 3 mM EDTA, 0.005% polysorbate 20 (v/v) 
temperature: 25°C; in assay buffer (see Table 1); glycosylated sDPP-IV 



* Hughes, T.E., Mone, M.D., Russell, M.E., Weldon, S.C. & Villhauer, E.B. (1999). NVP-DPP728 (1-[[[2-t(5-cyanopyridin-2-yl)amino]ethyl]aminol 
acetyl|-2-cyano-(S)-pyrrolidine). a slow-binding inhibitor of dipeptidyl peptidase IV. Biochemistry 38, 11597-11603 
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Crystallographic Data and Refinement Statistics 


Data set 


MAD Remote 


MAD Peak 


MAD Inflection 


Apo 


Diprotin-A 












complex 


Wavelength 


0.992 


1 .0065 


• 1 .009 


0.9765 


0.92 


X-ray source 


SLS 


SLS 


SLS 


ID14.ESRF 


SLS 


Detector 


MAR IP a 


MAR IP a 


. MAR IF>a 


Quantum CCD 


MAD ("("Pi 


Exposure time/ 


10 


1 0 




2 


4 


frame (s) 














2 0 


2 0 


' 




0.25 


increment per 












frame (°) 












total rotation 


110 


136 


140 


130 


130 


range (°) 












crystal to 


410 


410 


410 


240 


260 


detector 












distance (mm: 












unit cell 


65.2; 68,7; 420.1 


65.2; 68.7; 420.1 


65.2; 68.7; 


65.5; 68.2; 419.3 


65.1; 67.1; 41 9.6 


parameters a, 






420.1 






b,c (A) 












data reduction 


Maximum 


2.6 


2.6 


2.6 


2.1 


2.5 


Resolution (A) 












No. of 


212619 


263910 


276921 


234528 


171090 


measurements 












No. of unique 


58627 


59544 


59939 


87 113 


64208, 


reflections 












completeness 


97.5 (99,4) 


99.9 (100.0) 


99.9 (99.9) 


82.9 (72.3) 


97.5 (99.4) 


(%)* 












Rsym *, b * 


9.1 (15.9) 


9.0 (18.1) 


8.6 (14.2) 


8.4 (26.8) 


9.1(15.9) 


heavy-atom refinement paramet 


f(e)/f(e) 


-7.0/9.5 


-8.0/9.8 


-12.1/5.0 






Phasing 


0.95 


1.0 


0.7 






power 0 












(anomalous) 












Refinement statistics 


resolution 








20-2.1 


30-2.5 


range (A) 












°cryst (Rfree) d 








21 .5(26.5) 


22.5(28.2) 


(%) 












No. of protein 








11 962 


11 962 (27.1) 


atoms e (mean 
B 












in A 2 ) 








(34.6) 





a Marresearch image plate detector, diameter 345mm, 100jim pixel size 
* Values in parentheses are statistics for highest resolution bin. 
b R sym = -h-i ll i( h ) _<, ( n ) >l/ lhli< h )' where lj(h) und <l(h)> are the ith and 
c Phasing power ^hPHth^lFr^h)- IF N (h) + F H (h)ll. 

d *t, IIF obs' " l F cate"-h IF obs 1 ' where ""obs 1 and IF calc' 
the working (R cryst ) and test (R fres )sets, respectively. 
e Non-hydrogen atoms, only. 



;n measurement of the intensity of reflection h. 



observed and calculated structure fe 
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Table 3. (continued) 



Crystallographic Data and Refinement Statistics 


Data set 


MAD Remote 


MAD Peak 


MAD Inflection 


Apo 


Diprotin-A 
complex 


Refinement stati 


sties 




No. of water 
molecules 
No. of ligand/ 
heavy atoms 
(mean B in A*) 

No. of NAG 
atoms (mean B 
in A2) 

rmsd f bonds 
(A 2 ) 

Rmsd f angles 
(°) 








6 (77.3) . 

112(59.0) 

0.018 
1.86 


268(25.0) 
24 (28.3) 

98 (51.4) 

0.019 
2.07 



rmsd: root mean square deviation from mean. 



25 



30 



35 



40 



45 



50 



55 
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REFINEMENT REMARKS: 



2 . 1A resolution 



Table 4: Structure coordinates for human DPP-IV 

Table 4 lists the atomic structure coordinates for DPP-IV as derived by X-ray diffraction 
from a crystal of DPP-IV. 

HEADER 
COMPND 
COMPND 
SOURCE h 
REMARK 1 
REMARK 1 
REMARK 1 
REMARK 1 
REMARK 1 
REMARK 1 

refinement) 
REMARK 1 
REMARK . 1 
REMARK 2 
REMARK 2 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 



3 REFINEMENT. 

3 PROGRAM 

3 AUTHORS 
3 

3 REFINEMENT TARGET 



REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA (F) ) 

COMPLETENESS FOR RANGE (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING + TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 

FIT IN THE HIGHEST RESOLUTION BIN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH 
BIN RESOLUTION RANGE LOW 
REFLECTION IN BIN (WORKING SET) 

BIN R VALUE (WORKING SET) 

BIN FREE R VALUE SET COUNT 
BIN FREE R VALUE 



12 .00 
NONE 
82 .99 
87113 



THROUGHOUT 
RANDOM 
0.21747 
0.21485 
0.26560 
5.0 
4619 



2.100 
2 .153 
2014 
0.246 



ESTIMATED OVERALL COORDINATE ERROR. 
ESU BASED ON R VALUE 
ESU BASED ON FREE R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 

ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD (A*' 



RMS DEVIATIONS FROM IDEAL VALUES COUNT RMS 

BOND LENGTHS REFINED ATOMS (A): 12400 ; 0.018 

BOND LENGTHS OTHERS (A): 10588 ; 0.001 

BOND ANGLES REFINED ATOMS (DEGREES): 16B76 ; 1.867 

BOND ANGLES OTHERS (DEGREES): 24632 ; 0.889 

TORSION ANGLES, PERIOD 1 (DEGREES): 1454 ; 5.183 

TORSION ANGLES, PERIOD 3 (DEGREES): 2075 ,19.350 

CHIRAL-CENTER RESTRAINTS (A**3): 1790 ; 0.135 

GENERAL PLANES REFINED ATOMS (A): 13738 ; 0.007 

GENERAL PLANES OTHERS (A): 267 4 ; 0.004 

NON-BONDED CONTACTS REFINED ATOMS (A): 2592 ; 0.240 

NON-BONDED CONTACTS OTHERS (A): 10721 ; 0.223 



0.280 
0.228 
0.244 
9.427 



WEIGHT 
0.021 
0.020 
1.936 
3 .000 
3 .000 
15 .000 
0.200 
0.020 
0.020 
0.300 
0.300 
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REMARK 
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NON- BONDED TORSION OTHERS 
H-BOND (X. . . Y) REFINED ATOMS 
H-BOND (X...Y) OTHERS 
SYMMETRY VDW REFINED ATOMS 
SYMMETRY VDW OTHERS 
SYMMETRY H-BOND REFINED ATOMS 



(A) : 



ISOTROPIC THERMAL FACTOR RESTRAINTS. 
MAIN-CHAIN BOND REFINED ATOMS <A**2) : 
MAIN-CHAIN ANGLE REFINED ATOMS (A**2) : 
SIDE-CHAIN BOND REFINED ATOMS (A**2) : 
SIDE-CHAIN ANGLE REFINED ATOMS (A**2) : 



17 ; 0.494 

820 ; 0.155 

7 ; 0.115 

9 ; 0.235 

38 ; 0.277 

3 ; 0.397 

COUNT RMS 

7252 ; 0.874 

11766 ; 1.603 

5148 ; 2.300 

5110 ; 3.638 
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data collected at 100K at ID14 in Grenoble (ESRF, France) 
Phasing by MAD using Hg derivative and data collected to 2.7 A 
at Villigen (SLS, Switzerland) 

SER ARG LYS THR TYR THR LEU THR ASP TYR LEU LYS ASN 
THR TYR ARG LEU LYS LEU TYR SER LEU ARG TRP ILE SER 
ASP HIS GLU TYR LEU TYR LYS GLN GLU ASN ASN ILE LEU 
VAL PHE ASN ALA GLU TYR GLY ASN SER SER VAL PHE LEU 
GLU ASN SER THR PHE ASP GLU PHE GLY HIS SER ILE ASN 
ASP TYR SER ILE SER PRO ASP GLY GLN PHE ILE LEU LEU 
GLU TYR ASN TYR VAL LYS GLN TRP ARG HIS SER TYR THR 
ALA SER TYR ASP ILE TYR ASP LEU ASN LYS ARG GLN LEU 
ILE THR GLU GLU ARG ILE PRO ASN ASN THR GLN TRP VAL 
THR TRP SER PRO VAL GLY HIS LYS LEU ALA TYR VAL TRP 
ASN ASN ASP ILE TYR VAL LYS ILE GLU PRO ASN LEU PRO 
SER TYR ARG ILE THR TRP THR GLY LYS GLU ASP ILE ILE 
TYR ASN GLY ILE THR ASP TRP VAL TYR GLU GLU GLU VAL 
PHE SER ALA TYR SER ALA LEU TRP TRP SER PRO ASN GLY 
THR PHE LEU ALA TYR ALA GLN PHE ASN ASP THR GLU VAL 
PRO LEU ILE GLU TYR SER PHE TYR SER ASP GLU SER LEU 
GLN TYR PRO LYS THR VAL ARG VAL PRO TYR PRO LYS ALA 
GLY ALA VAL ASN PRO THR VAL LYS PHE PHE VAL VAL ASN 
THR ASP SER LEU SER SER VAL THR ASN ALA THR SER ILE 
GLN ILE THR ALA PRO ALA SER MET LEU ILE GLY ASP HIS 
TYR LEU CYS ASP VAL THR TRP ALA THR GLN GLU ARG ILE 
SER LEU GLN TRP LEU ARG ARG ILE GLN ASN TYR SER VAL 
MET ASP ILE CYS ASP TYR ASP GLU SER SER GLY ARG TRP 
ASN CYS LEU VAL ALA ARG GLN HIS ILE GLU MET SER THR 
THR GLY TRP VAL GLY ARG PHE ARG PRO SER GLU PRO HIS 
PHE THR LEU ASP GLY ASN SER PHE TYR LYS ILE ILE SER 
ASN GLU GLU GLY TYR ARG HIS ILE CYS TYR PHE GLN ILE 
ASP LYS LYS ASP CYS THR PHE ILE THR LYS GLY THR TRP 
GLU VAL ILE GLY ILE GLU ALA LEU THR SER ASP TYR LEU 
TYR TYR ILE SER ASN GLU TYR LYS GLY MET PRO GLY GLY 
ARG ASN LEU TYR LYS ILE GLN LEU SER ASP TYR THR LYS 
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VAL ASP PHE GLN ALA MET TRP TYR THR ASP GLU ASP HIS 
GLY ILE ALA SER SER THR ALA HIS GLN HIS ILE TYR THR 
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OD1 ASP A 104 
OD2 ASP A 104 
N TYR A 105 
CA TYR A 105 
C TYR A 105 
O TYR A 105 
CB . TYR A 105 
CG TYR A 105 
CD1 TYR A 105 
CD2 TYR A 105 
CE1 TYR A 105 
CE2 TYR A 105 
CZ TYR A 105 
OH TYR A 10S 
N SER A 106 
CA SER A 106 
C SER A 106 
O SER A 106 
CB SER A 106 
OG SER A 106 
N ILE A 107 
CA ILE A 107 
C ILE A 107 
O ILE A 107 
CB ILE A 107 
CGI ILE A 107 
CG2 ILE A 107 
CD1 ILE A 107 
N SER A 108 
CA SER A 108 
C SER A 108 
O SER A 108 
CB SER A 108 
OG SER A 108 
N PRO A 109 
CA PRO A 109 
C PRO A 109 
O PRO A 109 
CB PRO A 109 
CG PRO A 109 
CD PRO A 109 
N ASP A 110 
CA ASP A 110 
C ASP A 110 
0 ASP A 110 
CB ASP A 110 
CG ASP A 110 
OD1 ASP A 110 
OD2 ASP A 110 
N GLY A 111 
CA GLY A 111 
GLY A 111 



ATOM 616 O GLY A 111 



59.204 
58.343 
59.190 
58 .446 
58 .283 
57.658 
58.051 
57.440 
57.481 
56.598 
58.507 
56.663 
56.014 
55.324 
55.329 
55.018 
54.749 
55.381 
53.919 
54.736 
54.086 
52.838 
52.726 
54 .976 
55 .070 
54.132 
56.087 
54 .204 
56.169 
55 .231 
55.300 
51 .899 
50.719 
50 .252 
49.835 
49 .614 

48 .498 
50.272 

49 .959 
48 .473 
47 .875 
50.820 
52.193 
50.856 
53 .411 
47 .860 
46 .445 
46.247 
47.165 
45.958 
46.787 
45.062 
44.782 
44.722 
44 .922 

43 .390 
43 .059 
43.907 

44 .391 
44 .352 
45.715 
45.851 
43.227 
43.344 
44.363 
42.466 
46.725 
48.084 
48.338 
49.409 



75.580 
73 .215 
75.918 
72.154 
70.808 
69.826 
68. 671 
70.863 
69.554 
68.714 
69.389 
70.277 
69.397 
70.283 
71.511 
68.470. 
67.177 
66.957 
66.320 
69.656 
70.356 
69.609 
68.403 
70.362 
69. 000 
68.603 
68.106 
67. 347 
66.855 
66.485 
65.245 
70.313 
69.644 
70.331 
71.499 
69 . 625 
68 . 968 
69.567 
70. 124 
70. 005 
68.975 
69.433 
69.181 
70.256 
69.417 
71. 104 
71. 086 
70. 064 
69.818 
72.469 
73. 035 
69.465 
68.440 
68.956 
68.161 
67. 936 
68. 544 
69.734 
70.219 
70.690 
71.225 
71.807 
71 .703 
72.955 
73.103 
73 . 853 
71.007 
71.465 
72.982 
73.417 



17.805 
16.529 
20.873 
21.419 
20.430 
20.332 
22.706 
23.496 
23.352 
24.348 
19.702 
18.765 
17.769 
17.921 
19.477 
18.699 
17.647 
19.064 
16.764 
15 . 687 
15.319 
15.543 
14.448 
13 .774 
12.829 
14.096 
12.215 
13 .483 
12 .542 
11 . 937 
14 .724 
14 . 246 
12.986 
12 .976 
15 .291 
14.757 
11.920 
10.650 
10.454 
10.737 
9.595 
10.252 
8.325 
9.444 
10.053 
9.814 
8.713 
7 . 946 
9 .394 
8.386 
8. 631 
7 . 617 

6 . 185 
5.264 

7 . 981 
9 . 324 
9.501 
5 . 965 
4.590 

4. 196 
3.139 
4.328 
5.182 
5.860 

5. 223 
5. 031 
4.746 
4.684 
4.250 



1.00 61.66 
1.00 61.95 
1.00 61.93 
1.00 58.45 
1.00 57.23 
1.00 56.00 
1 .00 54 .77 
1 .00 57 . 35 
1.00 57 .35 
1.00 55.48 
1.00 56.22 
1 .00 55 . 10 
1.00 54 .76 
1.00 53 .89 
1.00 54.05 
1.00 55.23 
1.00 56.71 
1.00 55.96 
1.00 60.44 
1 .00 52 . 66 
1 .00 52 . 56 
1.00 51.25 
1.00 51.29 
1.00 53.10 
1 .00 55. 91 
1.00 59.33 
1.00 58.18 
1.00 60.78 
1.00 59.77 
1.00 60.60 
1 . 00 60. 13 
1 . 00 49. 68 
1.00 49.10 
1.00 48.51 
1.00 47.69 
1 . 00 48. 76 
1.00 48.08 
1.00 47.91 
1.00 48.19 
1.00 48.08 
1.00 47.65 
1.00 48.45 
1.00 48.77 
1.00 48.58 
1.00 50.54 
1.00 48.44 
1.00 49.14 
1.00 49.85 
1.00 50.02 
1.00 49.19 
1.00 50.05 
1.00 50.74 
l.OO 50.80 
1.00 51.14 
1.00 51.45 
1.00 50.92 
1.00 51.33 
1.00 50.54 
1.00 50.90 
1.00 51.01 
1.00 51.06 
1.00 52.40 
1.00 50.49 
1.00 49.53 
1.00 45.06 
1.00 49.22 
1.00 51.13 
1.00 51.16 
1.00 51 .30 
1.00 51.86 
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ATOM 756 CB HIS A 
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119 57.911 78 

119 55.992 79 

120 56.207 73 
120 55.536 72 
120 55.849 72 
120 57.000 71 
120 55.932 71 
120 55.770 69 
120 56.670 69 
120 54.715 68. 
120 56.542 68 
120 54.573 67 
120 55.489 67, 

120 55.379 66. 

121 54.806 71. 
121 54 .974 71 . 
121 54.123 70. 
121 52.908 70. 
121 54.554 72. 
121 54.784 72. 

121 55.362 73. 

122 54.778 69. 
122 54.099 68. 
122 53.217 68. 
122 53.673 63. 
122 55.092 65. 
122 54.369 65. 
122 55.212 64. 
122 54.469 63. 

122 54.953 61. 

123 51.986 67. 
123 51.137 67. 
123 51.141 65. 
123 52.073 65. 
123 49.705 67. 
123 49.014 67. 
123 47.565 63. 
123 47.118 68. 

123 46.810 67. 
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124 50.126 63. 
124 50.649 62. 
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124 49.988 64. 
124 48.102 65. 
124 50.248 64. 

124 49.286 65. 

125 50.164 61. 
125 50.657 60. 
125 50.379 61 
125 51.212 61 
125 50.103 59 
125 50.796 58 
125 50.153 56 
125 50.045 56 
125 50.984 55 
125 52.124 55 

125 50.780 55 

126 49.219 61 
126 48.922 62 
126 48.746 64 
126 49.039 64 
126 47.651 61 
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40.391 31.239 32.68: 



1.00 27.61 
1.00 28.54 
1.00 30.75 
1.00 30.82 
1.00 34.72 
1 .00 28. 17 
1 . 00 27 . 52 
1 . 00 27 . 45 
1 . 00 26 . 73 
1.00 27.25 
1.00 27.30 
1 . 00 27 . 40 
1 . 00 27 . 28 
1.00 27.42 
1 . 00 28 . 39 
1 . 00 28 . 82 
1 . 00 30 . 44 
1.00 33.24 
1.00 33 . 68 
1 . 00 34 . 06 
1.00 28.05 
1 . 00 28 . 06 
1 . 00 27 . 42 
1 . 00 28 . 10 
1.00 26.84 
1.00 26.81 
1.00 26.26 
1.00 25.67 
1.00 26.87 
1.00 28.61 
1.00 26.14 
1.00 26.40 
1.00 26.15 
1.00 24.92 
1.00 27.27 
1.00 27.73 
1.00 28.58 
1.00 29 . 54 
1.00 28.47 
1.00 27.03 
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N TYR A 661 
CA TYR A 661 
C TYR A 661 
O TYR A 661 
CB TYR A 661 
CG TYR A 661 
CD1 TYR A 661 
CD2 TYR A 661 
CE1 TYR A 661 
CE2 TYR A 661 
CZ TYR A 661 
OH TYR A 661 
N TYR A 6 62 
CA TYR A 6 62 
C TYR A 6 62 
O TYR A 6 62 
CB TYR A 662 
CG TYR A 662 
CD1 TYR A 662 
CD2 TYR A 662 
CE1 TYR A 662 
CE2 TYR A 662 
CZ TYR A 662 
OH TYR A 662 
N ASP A 663 
CA ASP A 6 63 
C ASP A 663 
O ASP A 663 
CB ASP A 663 
CG ASP A 663 
OD1 ASP A 663 
OD2 ASP A 663 
N SER A 6G4 
CA SER A 664 
C SER A 664 
O SER A 664 
CB SER A 664 
OG SER A 664 
N VAL A 665 
CA VAL A 665 
C VAL A 6 65 
O VAL A 665 
CB VAL A 6 65 
CGI VAL A 665 
CG2 VAL A 665 
N TYR A 666 
CA TYR A 666 
C TYR A 666 
0 TYR A 666 
CB TYR A 666 
CG TYR A 666 
CD1 TYR A 666 
CD2 TYR A 6 66 
CE1 TYR A 666 
CE2 TYR A 666 
CZ TYR A 666 
OH TYR A 666 
N THR A 667 
CA THR A 6 67 
C THR A 667 
O THR A 667 
CB THR A 667 
OG1 THR A 667 
CG2 THR A 667 
N GLU A 668 
CA GLU A 668 
C GLU A 668 
O GLU A 668 
CB GLU A 668 
CG GLU A 668 



43 .395 

44 .173 
44.228 
44 .621 
45.611 
45.819 
44 .956 
46.887 
45.166 
47 .100 
46.248 
46.446 
43 .906 
44.091 
42.328 
41.761 
44.657 
45.273 
46.229 
44.853 
46.760 
45.427 
46.370 
46.898 
42 .922 

41 .808 
40.605 
40.719 

42 .203 

42 .439 
41.517 

43 .539 
39.454 
38.199 
38.080 
37 .619 
37 .065 
37 .255 
38.446 
38.176 
38.901 
38.343 
38.510 
38.307 
37.629 
40. 155 
40.910 
40.585 
40.384 
42.382 
43.301 
43 .624 
43.861 
44.493 
44.715 
45.055 
45.894 
40.531 
40.355 
38.998 
38.896 
40.529 
41.790 
40.590 
37.949 
36.603 
36.276 
35.475 
35.569 
35.673 



51 . 579 
52.761 
53 .126 
54 .239 
52 .589 
52 .433 
53 .014 
51.717 
52 .840 
51 . 566 
52.117 
51.935 
52.192 
52.436 
53.020 
53.079 
51.187 

51 .547 
52 .581 
50.960 
52.983 
51 .326 
52 .345 
52 .771 
53 .492 
54 .237 
53 .355 
52.120 
55 .065 
54.223 
53 .513 
54 .229 
54 . 015 
53 .343 

52 .588 
51.473 
54.334 
55.329 
53 .195 
52 .557 
51 . 223 
50.255 
53.485 
52.741 
54 .748 
51 . 172 
49.952 
48 . 956 
47 .791 
50 .267 
49 .105 
48 .336 
48 .776 
47 .296 
47 .730 
47 .009 
45 .942 
49 .419 
48 .504 
47 .836 
46.605 
49 .220 
49.890 
48.214 
48.627 
48.073 
47 .103 
46 .201 
49 .207 
49.996 



48.777 
48.476 
46.967 
46. 651 
48 . 930 
50.422 
51.351 
50.905 
52 . 693 
52.259 
53.144 
54.523 
46.063 
44 . 647 
43.979 
44.599 
43 . 933 
42 . 627 
42 . 568 
41.439 
41.386 
40.236 
40.214 

39 .048 
42 .736 
42 . 199 
41.901 
41.715 

40 .984 
39.732 
39.230 
39.179 
41.894 
41.685 
40.375 
40.397 
41 .743 
40.782 
39.253 
37 .959 
37 .779 
37.338 
36.775 
35.374 
36.854 
38.130 
37 .933 
38.982 
38.677 
38.030 

37 .900 

38 .997 
36 .670 
38 .891 
36 . 545 
37 . 667 
37.574 
40.234 
41 . 354 
41.314 
41.452 
42.736 
42.820 
43.886 
41.147 
41.246 
40.084 
40.206 
41.358 
42.671 



1.00 22.69 
1.00 22.95 
1.00 22.89 
1.00 21.39 
1 . 00 22 . 98 
l.OO 23.13 
1.00 24.47 
1.00 21.26 
1.00 24.70 
1.00 20.68 
1.00 22.08 
1.00 21.51 
1.00 21.57 
1.00 21.52 
1.00 22 .38 
1.00 22.23 
1.00 22 .04 
1.00 21.82 
1.00 22.72 
1.00 21.25 
1.00 21.34 
1.00 25.30 
1.00 23.50 
1. 00 20 .23 
1.00 22.24 
1.00 22.59 
1.00 23.66 
1.00 23.06 
1.00 23.04 
1.00 24.88 
1.00 26.15 
1.00 25.50 
1. 00 23 .69 
1.00 24.34 
1.00 23.92 
1.00 23.78 
1 . 00 24 .41 
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N ALA B 306 
CA ALA B 306 
C ALA B 306 
O ALA B 306 
CB ALA B 306 
N THR B 307 
CA THR B 3 07 
C THR B 307 
O THR B 307 
CB THR B 307 
OG1 THR B 307 
CG2 THR B 307 
N GLN B 308 
CA GLN B 308 
C GLN B 308 
O GLN B 308 
CB GLN B 308 
CG GLN B 308 
CD GLN B 308 
OE1 GLN B 308 
NE2 GLN B 308 
N GLU B 30 9 
CA GLU B 309 
C GLU B 309 
O GLU B 309 
CB GLU B 309 
CG GLU B 3 09 
CD GLU B 309 
OE1 GLU B 309 
OE2 GLU B 309 
N ARG B 310 
CA ARG B 310 
C ARG B 310 
O ARG B 310 
CB ARG B 310 
CG ARG B 310 
CD ARG B 310 
NE ARG B 310 
CZ ARG B 310 
NH1 ARG B 310 
NH2 ARG B 310 
N ILE B 311 
CA ILE B 311 
C ILE B 311 
0 ILE B 311 
CB ILE B 311 
CGI ILE B 311 
CG2 ILE B 311 
CD1 ILE B 311 
N SER B 312 
CA SER B 312 
C SER B 312 
O SER B 312 
CB SER B 312 
OG SER B 312 
N LEU B 313 
CA LEU B 313 
C LEU B 313 
O LEU B 313 
CB LEU B 313 
CG LEU B 313 
CD1 LEU B 313 
CD2 LEU B 313 
N GLN B 314 
CA GLN B 314 
C GLN B 314 
O GLN B 314 
CB GLN B 314 
CG GLN B 314 
CD GLN B 314 



58.217 
59.201 
58.518 
59. 051 
59. 980 
57.365 
56.524 
55.055 
54.780 
56.695 
56.206 
58 .175 
54 . 120 
52 .690 
52 .351 
51.316 
51 .789 
52.147 

51 .937 
51.351 

52 .418 

53 .212 

52 . 907 

53 . 954 

53 .939 

52 .725 
51 .746 
51.386 
52.313 
50.169 

54 .873 
55.907 
56.220 
56 .494 
57.165 

58 .381 
59.598 

59 .478 . 
60.277 

60 .038 

61 .298 
56 .158 

56 .441 
57.663 

57 .848 
55 .290 

53 .962 
55.320 
52.812 
58.482 
59.650 
59.436 
59 .183 
SO .824 
61 .950 
59.540 
59 .361 
60.658 
61.245 

58 .273 
56 .897 
56 .013 
56.267 
61.129 
62.364 
62.066 
61.493 
63.364 
64.795 
65.863 



89.433 

89.737 

89.898 

89.449 

90.949 

90.554 

90.673 

90.588 

90.554 

92.039 

93.080 

92.410 

90.631 

90.608 

91.765 

91.743 

90.732 

89.883 

88.392 

87.991 

87.562 

92.775 

93.966 

94.349 

95.485 

95.188 

94. 973 

96. 277 

97.065 

96.513 

93.443 

93.777 

92.582 

91.501 

94.261 

94.291 

94.806 

96.236 

96.964 

98.258 

96.422 

92.800 

91.777 

92.159 

93 .319 

91 . 632 

91.831 

90.264 

91.743 

91. 162 

91.365 

90.700 

89.489 

90 .703 

91 .004 

91.468 

90.929 

91 . 067 

92 . 154 

91.687 

91 . 696 

92 . 618 

90.298 

89 . 979 

90.044 

89.884 

88.884 

89.018 

89.386 

88.543 



90.864 
91.912 
93 .262 
94.250 
91.575 
93.297 
94 . 506 
94 . 074 
92 . 869 
95 .248 
94 .416 
95.488 
95 . 034 
94 .714 
93 .762 
93 . 107 
95 . 984 
97 . 215 
96.987 
95.972 
97.924 
93 . 688 
92 . 883 
91. 879 
91.363 
93.765 
94 . 891 
95.577 
95.905 
95.748 
91 . 588 
90 . 620 
89 .759 
90 . 272 
91.337 

90 .481 

91 .231 

91 .473 

92 .225 
92 .349 
92 .870 
88 .453 
87 .475 
86.679 
86.298 
86.488 
87 . 171 
85.828 
86 .200 
86.401 
85 .576 
84 .218 
84 . 163 
86 . 242 
85.505 
83 ,139 
81 . 780 



81 . 
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1.953 
81 . 038 
81.697 
80.928 
81 .757 
80.454 
79.708 
78.215 
77 .814 
80.210 
79.861 
80.626 



1 . 00 31. 93 
1.00 32.77 
1.00 33.56 
1.00 35.17 
1.00 32.78 
1.00 34.17 
1 . 00 35 . 64 
1 . 00 36. 27 
1 . 00 36 . 59 
1.00 35.48 
1 . 00 34 . 55 
1 . 00 36 . 51 
1 . 00 36 . 50 
1.00 36.66 
l.OO 36.45 
1 . 00 36 . 23 
1 . 00 37 . 14 
1.00 37.08 
1 . 00 40 . 81 
1 . 00 38 .86 
1.00 37.75 
1.00 36.25 
1.00 36.43 
1.00 35.70 
1 .00 35 .36 
1.00 36.88 
1.00 41.29 
1 .00 46 .84 
1.00 50.25 
1.00 51.11 
1.00 34.99 
1.00 34 .77 
1.00 34.15 
1.00 34.19 
1.00 34.67 
1.00 34.36 
1 .00 37 .03 
1.00 38 . 15 
1 . 00 40 . 61 
1.00 42.04 
1.00 41.05 
1 .00 33 . 58 
1.00 33 . 34 
1 .00 32 .04 
1.00 31.10 
1 . 00 33 .75 
1.00 3S.32 
1 .00 33 .59 
1 . 00 38 .06 
1.00 30.98 
1.00 30.92 
1 .00 30 .94 
1.00 31.01 
1.00 30.36 
1.00 28.13 
1 .00 31 .72 
1 .00 31 . 65 
1 .00 31 .48 
l.OO 31.36 
1 .00 32 .02 
1.00 33.58 
1 .00 33 .88 
1 .00 35 .31 
1.00 30.90 
1.00 31.97 
1.00 31.49 
l.OO 32.91 
1.00 31.34 
l.OO 35.07 
1.00 35.88 
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ATOM 8370 CD1 ILE B 319 

ATOM 8371 N GLN B 32 0 

ATOM 8372 CA GLN B 320 
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65.443 87.481 81.274 1.00 36.84 

62.408 90.904 77.437 1.00 30.58 
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80.331 
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77.893 
76.753 
76.521 
75.928 
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74.876 
74 . 667 
81.268 
82 .347 
83.253 
82 .926 
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80 .293 
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84 .425 
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85.729 
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72 .688 
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90.729 
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91 .458 



1.00 39.82 
1.00 36.94 
1.00 37.65 
1.00 35.26 
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1.00 32.50 
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1.00 28.07 
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1.00 29.14 
1.00 28.78 
1.00 28.53 
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1.00 29.13 
1.00 28.93 
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1.00 28.25 
1.00 28.40 
1.00 27.96 
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1.00 30 .27 
1.00 29.70 
1.00 28.36 
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1.00 29.03 
1.00 29.68 
1.00 28.09 
1. 00 30. 67 
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CD1 PHE B 3 64 
CD2 PHE B 3 64 
CE1 PHE B 364 
CE2 PHE B 3 64 
CZ PHE B 3 64 
N THR B 3 65 
CA THR B 365 
C THR B 365 
O THR B 365 
CB THR B 365 
OG1 THR B 365 
CG2 THR B 365 
N LEU B 3 66 
CA LEU B 366 
C LEU B 366 
O LEU B 3 66 
CB LEU B 3 66 
CG LEU B 3 66 
CD1 LEU B 3 66 
CD2 LEU B 356 
N ASP B 3 57 
CA ASP B 367 
C ASP B 367 
O ASP B 367 
CB ASP B 367 
CG ASP B 367 
OD1 ASP B 367 
OD2 ASP B 367 
N GLY B 368 
CA GLY B 368 
C GLY B 368 
O GLY B 36 8 
N ASN B 369 
CA ASN B 369 
C ASN B 369 
O ASN B 369 
CB ASN B 369 
CG ASN B 369 
OD1 ASN B 369 
ND2 ASN B 369 
N SER B 370 
CA SER B 37 0 
C SER B 37 0 
O SER B 370 
CB SER B 370 
OG SER B 37 0 
N PHE B 371 
CA PHE B 371 
C PHE B 371 
O PHE B 371 
CB PHE B 371 
CG PHE B 371 
CD1 PHE B 371 
CD2 PHE B 371 
CE1 PHE B 371 
CE2 PHE B 371 
CZ PHE B 371 
N TYR B 372 
CA TYR B 372 
C TYR B 372 
O TYR B 372 
CB TYR B 372 
CG TYR B 372 
CD1 TYR B 372 
CD2 TYR B 372 
CE1 TYR B 372 
CE2 TYR B 372 
CZ TYR B 372 
OH TYR B 372 
N LYS B 373 



63 . 142 
63 .483 
63 .050 
63.385 
63.179 
65.498 
.65.501 
64 .051 
63 .203 
66 .180 
65.607 
67 .593 
63 .778 
62 .422 
61.714 
60 .527 
62 .437 
61 . 060 
60 .213 
61 . 191 
62 .415 
61.708 
61.542 
60 . 904 

62 .381 

63 .844 
64.185 

64 .721 
62 .096 
61.976 

62 .787 
62.665 
63.658 
64.459 
65.768 
66.471 
64.692 

63 .565 

62 .763 

63 .460 
66.132 
67 .354 
67 .210 
66.286 
68.521 
68 .386 

68 . 140 
68.122 

69 .476 
70.433 
67 . 163 
67.525 

67 .081 
68.305 
67.366 

68 .589 

68 .126 

69 .560 
70.807 
70.592 

69 .539 
71 .299 

71 .576 

70 .561 

72 .871 
70.812 
73.130 
72.080 
72 .281 
71 .611 



88.993 
90.953 
89.732 
91.681 
91.048 
88.889 
88 .851 
88 .945 
89 .406 
90.081 
91 .202 
90 . 095 
88 .542 
88 .551 
89.899 
89.936 



90.238 
91 .484 
89 . 082 
90.314 
89 . 107 
95. 180 
96.639 
97 . 071 
96.319 
97 . 159 
96 .482 
96 . 767 
98 . 288 
98 .822 
98.692 
9B .435 



88.096 100.286 
87.911 100.930 
86.873 100.145 
87.466 102.408 
91.019 98.842 



92, 303 
92 . 770 
93 .778 
93 . 386 
93.626 
93 .593 
93.849 
92.040 
92.413 
93. 633 
94. 129 
94.091 
95.265 
94.991 
95.905 
96. 083 
96. 993 
97.230 
97.500 
93.730 
93 . 402 
91. 984 
91.264 
93. 509 
92 . 513 
.91. 571 
90.218 
89.724 
90.494 
90. 106 
90.965 
92.277 
90.470 
93.056 
91.241 
92.543 
88 . 422 
87.819 
87.159 
86 . 583 
86 . 800 
87 . 370 
87 .609 
87 . 683 
88.117 
88.184 
88 .414 
88 .915 
87 . 194 



.718 
97 .272 
97 . 025 
99.544 
99 . 136 
97 . 906 
100 . 003 
96.315 
94 . 912 
94 . 521 
93 .412 
95.415 
95.121 
94.482 
94 . 109 
96 . 380 
96 . 553 
95.762 
97.902 
94.381 
93 . 711 
93 .213 
93 . 617 
94.679 
95 . 682 
92 .370 
91 .872 
91 .429 
91 .261 
90 .712 
89 .516 
89 .425 
88 . 493 
88 .336 
87 .380 
87 .299 
91 .244 
90.776 
89.421 
89.167 
91 .791 
93 .136 
94 .014 
93 .539 
95.313 
94.830 
95.693 
96 .950 
88 .572 



1.00 28.84 
1.00 29.07 
1.00 29.83 
1.00 31.45 
1.00 28.43 
1.00 30.02 
1.00 31.47 
1.00 32.11 
1 . 00 32 .79 
1.00 31.27 
1.00 28.45 
1 . 00 33 .17 
1.00 33.65 
1.00 34.69 
1 . 00 34 .24 
1.00 33.60 
l.OO 35.14 
1 . 00 38 .40 
1 . 00 39 .24 
1 . 00 41.28 
1.00 34.28 
1.00 34.37 
1.00 33.70 
1 .00 34.18 
1.00 34.77 
1.00 38.12 
1.00 37.36 
1.00 40.82 
1.00 33.10 
1.00 33.37 
1.00 33.02 
1.00 32.64 
1.00 32.97 
1.00 32.76 
1.00 30.68 
1.00 29.88 
1.00 33.68 
1.00 36.77 
1 .00 40. 15 
1.00 44.89 
1.00 29.62 
l.OO 28.42 
1.00 27.80 
1 .00 28 .01 
1 .00 27 . 82 
1 .00 28.71 
1.00 26.77 
1.00 26.51 
1.00 25.96 
1.00 26.04 
1.00 25.97 
1.00 26.30 
1.00 26.19 
1.00 26.20 
1.00 28.41 
1.00 25.01 
1.00 26.85 
1.00 25.84 
1.00 26.43 
1 .00 27.24 
1.00 27.59 
1.00 26.09 
1.00 25.58 
1.00 25.70 
1.00 27.45 
1.00 25.19 
1.00 27.08 
1.00 28.33 
1.00 28.13 
1.00 27.72 
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69.468 69.698 105.842 1.00 29.39 
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71.614 70.785 105.718 1.00 27.76 
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70.572 73.268 105.094 1.00 31.59 

71.282 73.943 106.272 1.00 33.08 

72.350 74.536 106.096 1.00 32.69 
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69.249 
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72 . 398 
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66 . 038 
65.051 
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67 .889 
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66.227 
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1.00 47.32 
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1. 00 47 .86 
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1.00 53.34 
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1.00 44.87 
1.00 43.55 
1.00 40.89 
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1.00 47.78 
1.00 48.95 
1.00 50.32 
1.00 38.45 
1.00 35.25 
1.00 34.02 
1.00 33.35 
1.00 34.98 
1.00 32.83 
1.00 33.15 
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1.00 34.42 
1.00 33.22 
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1.00 31.17 
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1.00 30.44 
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ATOM 10207 O LEU B 543 

ATOM 10208 CB LEU B 543 

ATOM 102 09 CG LEU B 543 

ATOM 10210 CD1 LEU B 543 

ATOM 10211 CD2 LEU B 543 

ATOM 10212 N LEU B 544 

ATOM 10213 CA LEU B 544 

ATOM 10214 C LEU B 544 

ATOM 10215 O LEU B 544 

ATOM 10216 CB LEU B 544 

ATOM 10217 CG LEU B 544 

ATOM 10218 CD1 LEU B 544 

ATOM 10219 CD2 LEU B 544 

ATOM 10220 N ASP B 545 

ATOM 10221 CA ASP B 545 

ATOM 10222 C ASP B 545 

ATOM 10223 O ASP B 545 

ATOM 10224 CB ASP B 545 

ATOM 10225 CG ASP B 545 

ATOM 10226 OD1 ASP B 545 

ATOM 10227 OD2 ASP B 545 

ATOM 10228 N VAL B 546 

ATOM 10229 CA VAL B 546 

ATOM 10230 C VAL B 546 

ATOM 10231 O VAL B 546 

ATOM 10232 CB VAL B 546 

ATOM 10233 CGI VAL B 546 

ATOM 10234 CG2 VAL B 546 

ATOM 10235 N TYR B 547 

ATOM 10236 CA TYR B 547 

ATOM 10237 C TYR B 547 

ATOM 10238 O TYR B 547 

ATOM 10239 CB TYR B 547 

ATOM 10240 CG TYR B 547 

ATOM 10241 CD1 TYR B 547 

ATOM 10242 CD2 TYR B 547 

ATOM 10243 CE1 TYR B 547 

ATOM 10244 CE2 TYR B 547 

ATOM 10245 CZ TYR B 547 

ATOM 10246 OH TYR B 547 

ATOM 10247 N ALA B 548 

ATOM 10248 CA ALA B 548 

ATOM 10249 C ALA. B 548 

ATOM 10250 O ALA B 54 8 

ATOM 10251 CB ALA B 548 

ATOM 10252 N GLY B 54 9 

ATOM 10253 CA GLY B 549 

ATOM 10254 C GLY B 549 

ATOM 10255 O GLY B 549 

ATOM 10256 N PRO B S50 

ATOM 10257 CA PRO B 550 

ATOM 10258 C PRO B 550 

ATOM 10259 O PRO B 550 

ATOM 10260 CB PRO B 550 

ATOM 10261 CG PRO B 550 

ATOM 10262 CD PRO B 550 

ATOM 10263 N CYS B 551 

ATOM 10264 CA CYS B 551 

ATOM 10265 C CYS B 551 

ATOM 10266 O CYS B 551 

ATOM 102 67 CB CYS B 551 

ATOM 102 68 SG CYS B 551 

ATOM 10269 N SER B 552 

ATOM 10270 CA SER B 552 

ATOM 10271 C SER B 552 

ATOM 10272 O SER B 552 

ATOM 10273 CB SER B 552 

ATOM 10274 OG SER B 552 

ATOM 10275 N GLN B 553 

ATOM 1027 6 CA GLN B 553 
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ATOM 10277 C GLN B 553 

ATOM 10278 O GLN B 553 

ATOM 10279 CB GLN B 553 

ATOM 10280 CG GLN B 553 

ATOM 10281 CD GLN B 553 

ATOM 10282 OE1 GLN B 553 

ATOM 10283 NE2 GLN B 553 

ATOM 10284 N LYS B 554 

ATOM 10285 CA LYS B 554 

ATOM 10286 C LYS B 554 

ATOM 10287 0 LYS B 554 

ATOM 10288 CB LYS B 554 

ATOM 10289 CG LYS B 554 

ATOM 10290 CD LYS B 554 

ATOM 10291 CE LYS B 554 

ATOM 10292 NZ LYS B 554 

ATOM 10293 N ALA B 555 

ATOM 10294 CA ALA B 555 

ATOM 10295 C ALA B 555 

ATOM 10296 O ALA B 555 

ATOM 10297 CB ALA B 555 

ATOM 10298 N ASP B 555 

ATOM 10299 CA ASP B 556 

ATOM 10300 C ASP B 556 

ATOM 10301 O ASP B 556 

ATOM 10302 CB ASP B 556 

ATOM 10303 CG ASP B 556 

ATOM 10304 OD1 ASP B 556 

ATOM 10305 OD2 ASP B 556 

ATOM 10306 N THR B 557 

ATOM 10307 CA THR B 557 

ATOM 10308 C THR B 557 

ATOM 10309 O THR B 557 

ATOM 10310 CB THR B 557 

ATOM 10311 OG1 THR B 557 

ATOM 10312 CG2 THR B 557 

ATOM 10313 N VAL B 558 

ATOM 10314 CA VAL B 558 

ATOM 10315 C VAL B 558 

ATOM 10316 O VAL B 558 

ATOM 10317 CB VAL B 558 

ATOM 10318 CGI VAL B 558 

ATOM 10319 CG2 VAL B 558 

ATOM 10320 N PHE B 559 

ATOM 10321 CA PHE B 559 

ATOM 10322 C PHE B 559 

ATOM 10323 O PHE B 559 

ATOM 10324 CB PHE B 559 

ATOM 10325 CG PHE B 559 

ATOM 10326 CD1 PHE B 559 

ATOM 10327 CD2 PHE B 559 

ATOM 10328 CE1 PHE B 559 

ATOM 10329 CE2 PHE B 559 

ATOM 10330 CZ PHE B 559 

ATOM 103 31 N ARG B 560 

ATOM 10332 CA ARG B 560 

ATOM 10333 C ARG B 560 

ATOM 10334 O ARG B 560 

ATOM 10335 CB ARG B 560 

ATOM 1033 6 CG ARG B 560 

ATOM 10337 CD ARG B 560 

ATOM 10338 NE ARG B 560 

ATOM 10339 CZ ARG B 560 

ATOM 10340 NH1 ARG B 560 

ATOM 10341 NH2 ARG B 560 

ATOM 10342 N LEU B 561 

ATOM 10343 CA LEU B 561 

ATOM 10344 C LEU B 561 

ATOM 1034 5 O LEU B 561 

ATOM 10346 CB LEU B 561 
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1.00 23.55 
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CA SER 
C SER 
O SER 
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N VAL 
CA VAL 
C VAL 
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N PHE 
CA PHE 
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CB PHE 
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B 641 
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B 642 
B 642 
B 642 
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B 642 
B 643 
B 643 
B 643 
B 643 
B 644 
3 644 
B 644 
B 644 
B 644 
B 644 
B 645 
B 645 
B 645 
B 645 
B 646 
B 646 
B 646 
B 646 
B 646 
B 646 
B 646 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 647 
B 648 
B 648 
B 648 
B 648 
B 648 
B 648 
B 648 
B 648 
B €48 
B 649 
B 649 
B 649 
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B 549 
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B 650 
B 65 0 
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B 550 
B 651 
B 651 
B 651 
B 651 
B 651 
B 651 
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B 652 



80.615 
80 .716 
81.948 
82.051 

82 .873 
84.017 
85.170 
85.976 

84 .582 
85.219 
85.259 
86.341 
87.582 
88.597 
87.463 
88.615 
89.333 
90.552 
88.167 
87.585 
88.569 
89.112 
89.207 
89.521 
88 .917 
B9.080 
87.989 
88.235 
B9.111 
89.123 
90 .338 
86 .780 

85 .668 
85.239 
85 .004 
84 .484 
84 .819 
85 .398 

84 .561 

85 .716 

84 .894 

85 .477 
85 .096 

84 .673 

83 .168 
82 .679 

85 .045 

84 .533 

85 .342 
84 .668 
84 .285 
82 .422 
80.966 
80 .289 
80.907 
80.428 
80.676 
79 .015 
78.238 
76.727 
76.193 
76.055 
74.614 
74.102 
74.453 
74 .137 
74 .768 
72 .594 
74.579 
73.254 



64.858 
65.577 
66.473 
67.295 
66.316 
67.187 
66.768 
67. 607 
67.206 
65. 956 
65.483 
64.941 
64.561 
64.215 
64.579 
64. 404 
63.079 
62. 990 
64.519 
63.294 
62.047 
60.707 
59.961 
58.765 
60.647 
60.025 
59.008 
58.021 
61 . 129 
60.546 
62 .054 
59 .217 
58.345 
57 .394 
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59.215 
60 . 173 

61 .391 
59 .863 
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60.754 
61 .944 
56.127 
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54.875 
53.751 
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55.500 
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57.321 
56.973 
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56.507 
54 .338 
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53 .922 
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52 .614 
51.696 
53 .560 
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55.028 
55 .281 
52 .364 
51.565 
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51 .814 

53 .695 
54.595 
54 . 409 

54 .610 

56 .075 
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53 .893 
55.203 

55 .245 
56 .300 
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57 .433 
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57 .988 
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61.392 

60 .817 
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61 .740 
59 .538 



1.00 28.92 
1.00 29.39 
1.00 29.96 
1.00 29.05 
1.00 29.53 
1.00 29.91 
1.00 31.18 
1.00 31.66 
1 .00 29.68 
1.00 28.01 
1.00 32.16 
1.00 33.04 
1.00 33.77 
1.00 33.62 
1.00 34.07 
1.00 34.04 
1.00 34.10 
1.00 34.02 
1.00 33.03 
1.00 31.58 
1.00 34.16 
1.00 33.94 
1.00 34.82 
1.00 35.73 
1.00 34.33 
1.00 33.90 
1.00 33.16 
1.00 32.54 
1.00 34.58 
1.00 34.63 
1.00 34.31 
1.00 32.43 
1.00 32.12 
1.00 31.73 
1.00 32.31 
1 . 00 32 . 01 
1.00 31.30 
1 .00 31.77 
1 . 00 31.08 
1.00 28.09 
1.00 28.81 
1.00 27.39 
1.00 32.24 
1.00 31.64 
1.00 32.05 
1.00 30.31 
1.00 32.05 
1 .00 31.04 
1.00 32.31 
1.00 35.71 
1.00 39.53 
1.00 31.54 
1.00 32.41 
1 . 00 31. 12 
1.00 30.91 
1.00 32.74 
1.00 37.84 
1.00 30.42 
1.00 29.72 
1.00 2B.44 
1.00 27.80 
1.00 26.95 
1.00 25.90 
1.00 25.19 
1.00 24.71 
1.00 25.72 
1.00 24.38 
1.00 26.85 
1.00 26.43 
1.00 24.38 
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ATOM 114 67 CD2 LEU B 701 
ATOM 1146B N LEU B 702 
ATOM 11469 CA LEU B 702 
ATOM 11470 C LEU B 702 
ATOM 11471 O LEU B 702 
ATOM 11472 CB LEU B 702 
ATOM 11473 CG LEU B 702 
ATOM 11474 CD1 LEU B 702 
ATOM 11475 CD2 LEU B 702 
ATOM 11476 N ILE B 703 
ATOM 11477 CA ILE B 703 
ATOM 11478 C ILE B 703 
ATOM 11479 O ILE B 703 
ATOM 11480 CB ILE B 703 
ATOM 11481 CGI ILE B 703 
ATOM 11482 CG2 ILE B 703 
ATOM 11483 CD1 ILE B 703 
ATOM 11484 N HIS B 704 
ATOM 11485 CA HIS B 704 
ATOM 11486 C HIS B 704 
ATOM 11487 O HIS B 704 
ATOM 11488 CB HIS B 704 
ATOM 11489 CG HIS B 704 
ATOM 11490 ND1 HIS B 704 
ATOM 11491 CD 2 HIS B 704 
ATOM 11492 CE1 HIS B 704 
ATOM 11493 NE2 HIS B 704 
ATOM 11494 N GLY B 705 
ATOM 11495 CA GLY B 705 
ATOM 11496 C GLY B 705 
ATOM 11497 O GLY B 705 
ATOM 11498 N THR B 706 
ATOM 11499 CA THR B 706 
ATOM 1150O C THR B 706 
ATOM 11501 O THR B 706 
ATOM 11502 CB THR B 706 
ATOM 11503 OG1 THR B 706 
ATOM 11504 CG2 THR B 706 
ATOM 11505 N ALA B 707 
ATOM 11506 CA ALA B 707 
ATOM 11507 C ALA B 707 
ATOM 11508 O ALA B 707 
ATOM 11509 CB ALA B 707 
ATOM 11510 N ASP B 708 
ATOM 11511 CA ASP B 708 
ATOM 11512 C ASP B 708 
ATOM 11513 O ASP B 708 
ATOM 11514 CB ASP B 708 
ATOM 11515 CG ASP B 708 
ATOM 11516 OD1 ASP B 708 
ATOM 11517 OD2 ASP B 708 
ATOM 11518 N ASP B 709 
ATOM 11519 CA ASP B 709 
ATOM 11520 C ASP B 709 
ATOM 4 11521 O ASP B 709 
ATOM 11522 CB ASP B 709 
ATOM 11523 CG ASP B 709 
ATOM 11524 OD1 ASP B 709 
ATOM 1152 5 OD2 ASP B 709 
ATOM 11526 N ASN B 710 
ATOM 11527 CA ASN B 710 
ATOM 11528 C ASN B 710 
ATOM 11529 O ASN B 710 
ATOM 11530 CB ASN B 710 
ATOM 11531 CG ASN B 710 
ATOM 11532 OD1 ASN B 710 
ATOM 11533 ND2 ASN B 710 
ATOM 11534 N VAL B 711 
ATOM 11535 CA VAL B 711 
ATOM 11536 C VAL B 711 



71.595 53.318 58.502 1.00 22.99 

70.912 57.536 55.820 1.00 23.77 

70.125 58.717 56.064 1.00 23.66 

68.759 58.265 56.525 1.00 22.87 

68.089 57.440 55.863 1.00 22.89 

70.068 59.512 54.769 1.00 23.91 

69.135 60.686 54.696 1.00 23.94 

69.731 61.768 55.572 1.00 26.82 

69.079 61.125 53.298 1.00 25.37 

68.356 58.761 57 . 5B0 1.00 22.92 

67.108 58.346 58.299 1.00 23.25 

66.308 59.532 58.745 1.00 23.23 

66.868 60.463 59.318 1.00 23.16 

67.375 57.497 59.513 1.00 24.17 

68.285 56.354 59.156 1.00 25.71 

66.057 56.981 60.096 1.00 23.96 

68.895 55.693 60.368 1.00 27.51 

64.989 59.498 58.510 1.00 22.56 

64.120 60.615 58.931 1.00 21.90 

62.684 60.148 59.135 1.00 21.79 

62.214 59.265 58.411 1.00 23.20 

64.173 61.714 57.891 1.00 20.48 

64.148 63.084 58.455 1.00 20.84 

65.091 64.034 58.125 1.00 22.71 

63.285 63.687 59.309 1.00 19.35 

64.812 65.163 58.766 1.00 22.05 

63.738 64.969 59.513 1.00 17.00 

61.992 60.719 60.115 1.00 21.89 

60.612 60.378 60.420 1.00 21.24 

59.716 61.273 59.587 1.00 22.48 

60.002 62.462 59.447 1.00 22.15 

58.628 60.740 59.020 1.00 22.26 

57.878 61.557 58.086 1.00 22.34 

56.952 62.542 58.755 1.00 21.18 

56.578 63.486 58.121 1.00 20.70 

57.077 60.724 57.052 1.00 22.35 

56.131 59.951 57.740 1.00 20.72 

57.913 59.659 56.425 1.00 24.38 

56.622 62.369 60.022 1.00 21.54 

55.778 63.352 60.713 1.00 22.57 

56.614 64.271 61.623 1.00 23.06 

56.133 64.673 62.665 1.00 23.66 

54.675 62.642 61.583 1.00 22.37 

57.850 64.573 61.232 1.00 23.56 

58.731 65.467 61.996 1.00 23.27 

58.293 66.920 61.844 1.00 22.78 

58.520 67.556 60.809 1.00 22.16 

60.165 65.284 61.523 1.00 23.23 

61.218 65.711 62.566 1.00 23.33 

60.975 66.664 63.341 1.00 18.66 

62.319 65.109 62.642 1.00 17.00 

57.646 67.420 62.881 1.00 22.06 

57.170 68.793 62.954 1.00 22.26 

58.278 69.756 63.284 1.00 21.89 

58.053 70.941 63.221 1.00 21.79 

56.140 68.959 64.097 1.00 21.75 

56.670 68.443 65.449 1.00 23.45 

56.818 67.219 65.622 1.00 21.03 

5S.958 69.184 66.403 1.00 24.68 

59.432 69.235 63.702 1.00 22.00 

60.568 70.042 64.178 1.00 21.88 

61.607 70.273 63.084 1.00 20.54 

61.738 71.383 62.555 1.00 20.05 

61.128 69.354 65.446 1.00 21.90 

62.126 70.223 66.259 1.00 24.00 

62.401 69.902 67.439 1.00 30.50 

62.686 71.259 65.656 1.00 20.43 

62.377 69.244 62.775 1.00 20.79 

63.293 69.282 61.648 1.00 21.00 

62.507 68.571 60.563 1.00 21.12 
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Column 2 lists 
Column 3 lists 
the element. 
Column 4 lists 
Column 5 lists 



a number for the atom in the structure. 

the element whose coordinates are measured. The first letter in the column defines 

the type of amino acid. 

a number for the amino acid in the structure. 
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Columns 6-8 list the crystallographic coordinates X, Y, and Z respectively. The crystallographic 
coordinates define the atomic position of the element measured. 

Column 9 lists an occupancy factor that refers to the fraction of the molecules in which each 

atom occupies the position specified by the coordinates. A value of "1" indicates that each atom 

has the same conformation, i. e., the same position, in all molecules of the crystal. 

Column 10 lists a thermal factor "B" that measures movement of the atom around its atomic 

center. 
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Sequence Listing 

<110> F.Hof fmann-LaRoche AG 

<120> Crystal structure of DPP-IV and its use 

<130> Case 21491 

<160> 2 

<170> Patentln version 3.1 
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<211> 2211 

<212> DMA 
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<221> misc_f eature 

<223> cDNA for aa 31 to 766 of DPP-IV 
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<210> 2 

<211> 736 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> extracellular domain of DPP-IV 

<222> (1)..(736) 

<223> 

<400> 2 

Gly Thr Asp Asp Ala Thr Ala Asp Ser Arg Lys Thr Tyr Thr Leu Thr 
1 5 10 15 

Asp Tyr Leu Lys Asn Thr Tyr Arg Leu Lys Leu Tyr ser Leu Arg Trp 
20 25 30 

He Ser As? His Glu Tyr Leu Tyr Lys Gin Glu Asn Asn He Leu Val 
35 40 45 

Phe Asn Ala Glu Tyr Gly Asn Ser Ser Val Phe Leu Glu Asn Ser Thr 
50 55 60 

Phe Asp Glu Phe Gly His Ser He Asn Asp Tyr Ser He Ser Pro Asp 
65 70 75 80 

Gly Gin Phe He Leu Leu Glu Tyr Asn Tyr Val Lys Gin Trp Arg His 
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Lys He Glu Pro Asn Leu Pro Ser Tyr Arg He Thr Trp Thr Gly Lys 
145 150 155 160 



Leu Tyr Tyr He Ser Asn Glu Tyr Lys Gly Mec Pro Gly Gly Arg Asn 
385 390 395 400 
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Val Ala Ser Phe Asp Gly Arg Gly Ser Gly Tyr Gin Gly Asp Lys lie 
545 550 555 560 

Met His Ala lie Asn Arg Arg Leu Gly Thr Phe Glu Val Glu Asp Gin 
565 570 575 

lie Glu Ala Ala Arg Gin Phe Ser Lys Met Gly Phe Val Asp Asn Lys 
580 585 590 

Arg lie Ala lie Trp Gly Trp Ser Tyr Gly Gly Tyr Val Thr Ser Met 
595 600 605 

Val Leu Gly Ser Gly Ser Gly Val Phe Lys Cys Gly lie Ala Val Ala 
610 615 620 



Met Gly Leu Pro Thr Pro Glu Asp Asn Leu Asp His Tyr Arg Asn Ser 
645 650 655 



lie His Gly Thr Ala Asp Asp Asn Val His Phe Gin Gin Ser Ala Gin 
675 680 685 

He Ser Lys Ala Leu Val Asp Val Gly Val Asp Phe Gin Ala Met Trp 
690 695 700 

Tyr Thr Asp Glu Asp His Gly He Ala Ser Ser Thr Ala His Gin His 
705 710 715 720 



Claims 

1 . A crystal of the extracellular domain of mammalian DPP-IV. 

2. The crystal of claim 1 , characterized as having an orthorhombic space group of P2 1 2 1 2 1 and one homodimer of 
DPP-IV in the asymmetric unit. 

3. The crystal of claims 1 and 2, wherein the crystal has unit cell dimensions of: 

a is from 63 A to 70 A; 
b is from 66 A to 70 A; 
c is from 41 6 A to 424 A; 

and a P2 1 2 1 2 1 symmetry. 

4. The crystal of claims 1 to 3, characterized by the atomic structure coordinates of Table 4. 

5. A co-crystal of the extracellular domain of mammalian DPP-IV and a ligand bound to its active site. 

6. The crystal of claim 5, characterized as having an orthorhombic space group of P2 1 2 1 2 1 and one homodimer of 
DPP-IV in the asymmetric unit. 

7. The co-crystal of claim 6, wherein the co-crystal has unit cell dimensions of: 

a is from 63 A to 70 A; 
b is from 66 A to 70 A; 
c is from 41 6 A to 424 A; 



and a P2 1 2 1 2 1 symmetry. 



EP 1 422 293 A1 



8. A co-crystal of the extracellular domain of mammalian DPP-IV and a ligand bound to an allosteric binding site. 

9. A co-crystal of the extracellular domain of mammalian DPP-IV and HgCI 2 . 

10. A method for crystallizing mammalian DPP-IV, the method comprising 

(a) providing a buffered, aqueous solution of pH 7 to 8.5 with a concentration of 7 mg/ml to 22 mg/ml of the 
extracellular domain of mammalian DPP-IV; and 

(b) growing crystals by vapor diffusion using a buffered reservoir solution with between 10% and 30% PEG, 
between 10% and 20% glycerol, wherein PEG has an average molecular weight between 1000 and 20000. 

11 . The method according to claim 1 0, wherein the extracellular domain of mammalian DPP-IV of step (a) is produced 
in P. pastoris and then deglycosylated. 

12. A method for co-crystallizing mammalian DPP-IV and an active site ligand, the method comprising 

(a) providing a buffered, aqueous solution of pH 7 to 8.5 with a concentration of 7 mg/ml to 22 mg/ml of the 
extracellular domain of mammalian DPP-IV; 

(b) adding a molar excess of the active site ligand to the aqueous solution of mammalian DPP-IV; 

(c) growing crystals by vapor diffusion using a buffered reservoir solution with between 10% and 30% PEG, 
between 10% and 20% glycerol, wherein PEG has an average molecular weight between 1000 and 20000. 

13. The method according to claim 12, wherein the extracellular domain of mammalian DPP-IV of step (a) is produced 
in P. pastoris and then deglycosylated. 

14. A crystal produced by the methods according to claims 1 0 to 1 3. 

15. A method for determining the three-dimensional structure of a crystallized extracellular domain of mammalian 
DPP-IV to a resolution of 3.5A to 2.1 A or better, the method comprising 

(a) crystallizing an extracellular domain of mammalian DPP-IV; and 

(b) analysing the extracellular domain of mammalian DPP-IV by X-ray diffraction to determine the three-di- 
mensional structure of the crystallized extracellular domain of mammalian DPP-IV, whereby the three-dimen- 
sional structure of a crystallized extracellular domain of mammalian DPP-IV is determined to a resolution of 
3.5A to 2.1 A or better. 

16. A machine-readable data storage medium comprising a data storage material encoded with machine readable 
data which, when using a machine programmed with instructions for using said data, displays a graphical three- 
dimensional representation of a molecule or molecular complex comprising at least a portion of the extracellular 
domain of mammalian DPP-IV comprising the amino acids of SEQ ID NO:2, the extracellular domain comprising 
the ligand binding active site being defined by a set of points having a root mean square deviation of less than 
about 1 5A from points representing the backbone atoms of said amino acids as represented by structure coordi- 
nates listed in Table 4. 

17. A method for identifying a compound that interacts with DPP-IV, comprising the steps of 

(a) generating a three-dimensional model of DPP-IV using the structure coordinates listed in Table 4, a root 
mean square deviation from the backbone atoms of said amino acids of less than 1 .5A; and 

(b) employing said three-dimensional model to design or select a compound that interacts with DPP-IV. 

18. The method according to claim 1 7, further comprising the steps of 

(c) obtaining the identified compound; and 

(d) contacting the obtained compound with DPP-IV in order to determine the effect the compound has on 
DPP-IV activity. 
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19. The method according to claims 17 and 18, wherein the compound interacts with the active site of DPP-IV. 

20. The method according to claims 1 7 and 1 8, wherein the compound interacts with an allosteric binding site of DPP-IV. 

21. The method according to claims 17 and 20, wherein the compound is an inhibitor of DPP-IV activity. 

22. The method according to claims 1 7 to 21 , wherein the method is a computer-assisted method. 

23. A compound identified by the methods according to claims 1 7 to 22. 

24. A pharmaceutical composition comprising the compound of claim 23 and a pharmaceutical^ acceptable carrier. 

25. A compound according to claim 23 for use as a therapeutic active substance, in particular for the treatment of 
diabetes type I, diabetes type II, IGT, obesity and cancer. 

26. An isolated nucleic acid sequence encoding the soluble extracellular domain of DPP-IV comprising the nucleotide 
sequence of SEQ ID NO:1 . 

27. A nucleic acid construct comprising an expression vector and the nucleic acid sequence according to claim 26. 

28. A host cell transformed with the nucleic acid construct according to claim 27. 

29. A method of producing the soluble extracellular domain of DPP-IV comprising culturing the host cell of claim 28 
under conditions permitting the expression of the soluble extracellular domain of DPP-IV by the host cell. 

30. The method according to claim 29, wherein the host cell is P. pastoris. 

31. A polypeptide comprising the soluble extracellular domain of DPP-IV as set forth in SEQ ID NO:2. 

32. Use of a compound according to claim 23 for the manufacture of a medicament for the treatment of diabetes type- 
I, diabetes type-ll, IGT, obesity and cancer. 

33. Use of a crystal or a co-crystal according to claims 1 to 9 for the identification and/or design of inhibitors of DPP-IV 
activity. 

34. The novel crystals, methods, compounds, compositions and uses substantially as herein before described espe- 
cially with reference to the foregoing Examples. 
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Figure 3B 




BNSDOCID: <EP 1422293A1_I_= 



209 



EP 1 422 293 A1 




EP 1 422 293 A1 



Figure 4 
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